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Abstract 



This paper studies the multicast routing and admission control problem 
on unit-capacity tree and mesh topologies in the throughput-model. The 
problem is a generalization of the edge-disjoint paths problem and is NP- 
hard both on trees and meshes. 

We study both the offline and the online version of the problem: In the 
offline setting, we give the first constant-factor approximation algorithm for 
trees, and an (9((loglog«) 2 )-factor approximation algorithm for meshes. 

In the online setting, we give the first polylogarithmic competitive online 
algorithm for tree and mesh topologies. No polylogarithmic-competitive al- 
gorithm is possible on general network topologies [8] and there exists a poly- 
logarithmic lower bound on the competitive ratio of any online algorithm on 
tree topologies [3]. We prove the same lower bound for meshes. 

1 Introduction 

Multicast routing and admission control are the basic operations required by future 
high-speed communication networks that use bandwidth-reservation for quality- 
of-service guarantees. A number of applications from collective communication to 
data distribution will be based on efficient multicast communication. 

Formally, the multicast routing and admission control problem with M multi- 
casts consists of an rc-node graph G and a sequence or set of requests (t, Si), where 
the request node t and the source node Sj are nodes in G and i e {1,2,..., M). 
Multicast i consists of all requests with source Sj . For each request the algorithm 
has to decide whether to accept or reject it. If request (t, s,) is accepted, the algo- 
rithm has to connect node t to the multicast tree connecting the already accepted 
requests of multicast i with source s,-. In the unit-capacity setting, each link can be 
assigned to only one multicast tree: the trees spanning different multicasts must be 
edge-disjoint. The objective function is to maximize the total number of accepted 
requests. In the online version the requests form a sequence and when processing 
a request, the algorithm must decide without knowledge of future requests. In the 
offline version the requests form a set, which is given before the algorithm decides 
which requests to accept. 

Online multicast routing was recently studied under the small bandwidth as- 
sumption that the link bandwidth required by every connection is at most a frac- 
tion logarithmic in the size of the network. Awerbuch and Singh [7] gave an 
O (log n (log n + log log .M) log M) -competitive algorithm for the case in which 
all the requests to a given multicast arrive before the next multicast is created. 
Goel, Henzinger, and Plotkin [12] extended the study to the case in which requests 
to different multicasts can be interleaved. 
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With the sizes of networks growing faster than the link capacity, the small 
bandwidth request assumption is not necessarily a realistic assumption. There are 
many applications, for instance a multimedia server managed by a supercomputer, 
in which large amount of data must be transferred in a local network where a single 
communication path consumes a large fraction of the available bandwidth on a link 
[6]. Thus, the situation where the bandwidth required by a connection is a large 
fraction of the link capacity needs to be studied as well for the multicast routing 
problem. In this paper we take a first step into this direction by assuming that every 
connection uses the total bandwidth on a link. We call this the unit-capacity case. 

This paper studies both the offline and the online version of the multicast rout- 
ing and admission control problem in unit-capacity graphs. The offline problem 
models the case of arrival of a batch of connection requests to several multicasts. It 
is also motivated by all those situations where the answer to the user can be delayed 
for a limited time while other requests are collected. 

We present algorithms for tree and mesh topologies, which are at the basis 
of many communication networks. Trees are important practical network topolo- 
gies [5, 6, 21, 17], they are at the basis of topologies for communication networks 
such as trees of rings, often considered as interconnection of SONET rings optical 
networks [21, 17], or topologies for connecting high performance multicomputers 
systems as trees of meshes [6] and fat trees. The multicast routing problem on 
trees, when all the multicast groups use the same spanning tree, is then a basic 
problem to solve in this context. There has also been an extensive study of the uni- 
cast problem on these network topologies motivated by virtual circuit assignment 
and optical communication. Meshes topologies are often the basis of the inter- 
connecting topology of high performance multiprocessor systems. They are also 
relevant as a first approximation of nearly -planar communication networks [14]. 
The offline problem on meshes arises also in FPGA-routing, where various sub- 
sets of components have to be connected by trees such that the trees of different 
subsets do not overlap and the underlying routing fabric is a mesh. The unicast 
problem for meshes was recently studied in both the offline and the online version 
(see e.g. [6, 14, 19, 16]). 

Note that the multicast routing problem in unit-capacity graphs reduces to the 
edge-disjoint paths problem if only one request is presented for each multicast, also 
called the unicast setting. Multicast routing is also an interesting extension of the 
maximum coverage problem [13]. 

Previous work on unit-capacity networks. All previous work on unit-capacity 
networks studied unicast routing. Unlike multicast routing, the offline unicast prob- 
lem is still polynomial on trees [11], but it is NP-hard on meshes. Kleinberg and 
Tardos [14] proposed the first constant approximation algorithm for edge-disjoint 
paths on meshes and on a class of planar graphs called "densely embedded, nearly- 
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Eulerian graphs". They formulated the escape problem as an interesting subprob- 
lem and gave a constant-factor approximation for the escape problem in the unicast 
setting. A straightforward extension of their approach leads to a O (log n) -factor 
approximation for the escape problem for multicasts. We use instead a recursive 
approach that achieves a 0((loglog«) 2 )-factor approximation. 

For the online problem no algorithm, not even a randomized one, has a poly- 
logarithmic competitive ratio for any network topology [8]. This holds even in 
the unicast setting. Therefore in the unicast setting restricted graph topologies like 
trees, meshes, and "densely embedded, nearly-Eulerian graphs" [5, 6, 14, 16] were 
studied before and algorithms with logarithmic competitive ratio were proposed 
for all these network topologies. 

Our offline results. The problem on trees contains the MAX-3SAT problem 
and is thus MAX-SNP hard [1]. We present a polynomial time approximation al- 
gorithm for unit-capacity trees that achieves an approximation ratio of 18. The 
algorithm presents an interesting version of a greedy strategy. Each step schedules 
the "densest residual subtree" for a multicast and discards the overlapping subtrees 
of different multicasts already selected. The "densest residual subtree" of a multi- 
cast is the subtree maximizing the ratio between a value related to the net increase 
of the objective function after the selection, and a weight associated with the sub- 
tree itself. The algorithm can be easily implemented using a dynamic programming 
approach. To the best of our knowledge no approximation algorithm was known 
for this problem before. 

We also present the first approximation algorithm on unit-capacity meshes. Our 
polynomial time algorithm obtains an approximation ratio of O ((log log n) 2 ). It 
formulates the multicast routing problem as a fractional packing problem [10, 18, 
23] which is solved using duality-based algorithms. The fractional solution is then 
rounded probabilistically, leading to a potentially infeasible set of multicast trees, 
which are used to guide the construction of an integral solution. 

Our online results. An online algorithm A for the multicast routing problem 
has competitive ratio c if on any input sequence the ratio of the number of requests 
accepted by the optimum offline algorithm to the expected number of requests 
accepted by A is at most c. 

This paper shows how to achieve polylogarithmic-competitive algorithms for 
restricted topologies in the multicast setting: For trees we present an O (log n (log n+ 
log log M) log M) -competitive multicast algorithm and for meshes we give an 
0(log ?i(logra + log log .M) log A4) -competitive multicast algorithm. We also 
show a randomized lower bound of Q((logn log M)/d) for a connected graph 
with minimum degree d. This gives a lower bound of Q (log n log M.) for meshes. 
The same lower bound for trees follows from [3]. No competitive multicast al- 
gorithms were known for these topologies before. There are various difficulties 
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that multicast algorithms face over unicast algorithms. One of them is that latter 
multicasts might be more profitable than earlier ones. Thus, our algorithms accept 
each multicast that pass an initial screening for "mutability" with roughly equal 
probability. 

Section 2 of this paper presents the constant approximation algorithm on trees, 
Section 3 contains the online algorithm for trees, Section 4 gives the offline algo- 
rithm on meshes, Section 5 presents the online algorithm on meshes. 

2 The offline algorithm for trees 

We present a constant-factor approximation algorithm on trees. To denote the i-th 
multicast whose request node set is V we use the pair (i, V). A submulticast (i, V) 
of (i, V) is a multicast with source Si and request node set V C V. Our approach 
is to use a greedy algorithm that maintains an initially empty set S of (potentially) 
accepted submulticasts and assigns a weight and a residual profit to each submulti- 
cast. The algorithm repeatedly adds to S the submulticast that maximizes the ratio 
of its residual profit to its weight. Since the algorithm is offline, it can first accept 
a submulticast and then later add or subtract from it. We indicate this by saying 
that (i, V) is added to or removed from the current set S of submulticasts. Two 
submulticasts (i, V) and (/', V') overlap if they share an edge. We only add (i, V) 
to S if its profit is significantly larger than the profit lost by submulticasts which 
overlap with (i, V). 

We root the tree T at an arbitrary leaf. This defines an ancestor-descendant 
relation on the nodes of the tree. Let T(i, V) be the tree connecting the nodes of 
V to the source of i. The highest node of T(i, V) is called the root root(i, V) of 
(i, V). Note that the root does not have to belong to V. We say r is a subroot 
of (/, V) if r is the root of one of the submulticasts of (/, V). For each subroot 
r we say (i, V) is the maximum submulticast max(i, r) of (/, V) if (/, V) is the 
submulticast of (i, V) with root r that has the maximum number of requests. 

Next we define a weight for each multicast such that multicasts "higher" in 
the tree have higher weight and hence are added to S "later", except if they are 
very profitable. Given a submulticast (i, V) with root r and a multicast (/', V) 
with i' ^ i, let R(i', i, r) be the set of subroots r' of (i', V') such that r' is a true 
descendant of r and max(i', r') overlaps with max(i, r). For each multicast (i, V) 
and each possible root position r, we define the weight w(i, r) to be 



For all multicasts (i, V) and (/', V') with i ^ V and all subroots r, max(i, r) 




5 



and R(i', i, r) can be computed in polynomial time. Thus, w(i, r) can be computed 
in polynomial time by a bottom-up traversal of the tree. 

The profit p(i, V) of a submulticast (i, V) is the number of requests in (i, V). 
For i f the overlapping profit p(i, V, i', V) of submulticast (i, V) and (/', V') 
is defined to be the profit of the maximum submulticast of (/', V) whose requests 
cannot be accepted if (i, V) is accepted, i.e., the number of requests of (/', V) 
that cannot be accepted in (/, V) is accepted. For i — i' the overlapping profit 
p(i, V, V , V) of submulticast (i, V) and (/', V) is defined to be the profit of 
(/', V n V). Note that in general p(i, V, i' , V) # p(i', V , i, V). 

Let 0(i, V) be the set of submulticasts overlapping with (/, V). For a submul- 
ticast (i, V) the residual profit 

PresH, V) = P (i, V)-a J2 PQ> V ' ? "' 

(i',V')eSnO(i,V) 

where a > 1 is a constant to be chosen later. Let the ratio r(i, V) of a submulti- 
cast be defined to be p res (i, V)/w(i, r), where r = root(i, V). Now the greedy 
algorithm works as follows: 

(1) S - 0 

(2) for each submulticast (i, V): the residual profit p res (i, V) — p(i, V) 

(3) while there exists a submulticast not in 5 with positive residual profit: 

(4) Let (i, V) be a submulticast with maximum r(i, V) of all submulticasts not in S. 

(5) Let S de i = {(/', V"), (i', V") is the maximum submulticast of (/', V) e S 

whose requests cannot be accepted together with (i, V)} 

(6) S - S U (i, V) \ S del 

(7) Update the residual profit for each submulticast. 

Let the profit p(S) of set S of submulticasts be v)es P(*> ^ ( l '> ^) ^ s 
added to 5, then 

^ p(i,V,i' ',V') = p(Sdei). 
(i',V')eSr\0(i,V) 

Thus, the residual profit of a submulticast compares its profit with the profit lost 
from S if the submulticast is added to S. We first show that the algorithm termi- 
nates. 

Lemma 1 The algorithm terminates after at most nM. iterations. 

Proof. Whenever a multicast (i, V) is added to S and a set S^ei of submulticasts is 
deleted, p res (i) > 0. It follows that p(i, V) > ap(Sdei)- Thus, p(S) increases in 
each iteration by at least 1. The maximum value it can assume is nM. ■ 
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We prove in the next subsection that this algorithm gives a constant factor ap- 
proximation of the optimum solution. In the following subsection we show how to 
implement each iteration in polynomial time. 



2.1 Proof of the constant approximation ratio 

To prove that this algorithm gives a constant factor approximation we distinguish 
three types of overlaps: If T(i, V) contains an edge incident to the root(i' ', V) then 
(i, V) is ancestor-touching (a-touching) (/', V'). Note that either root{i' ', V') — 
rootii, V) or root{i, V) is an ancestor of root(i' , V). If root(i, V) is a true de- 
scendant of root {V, V) and T(i, V) C T(i', V) then (i, V) is internal to (/', V). 
Otherwise, i.e., if (i, V) and {V , V) overlap, rootii, V) is a true descendant of 
rootii' , V)>t>ut T(i, V) T(i', V') then (i, V) is descendant-touching (d-touching) 

a', v). 

The weight of a multicast was defined such that the following lemma holds. 

Lemma 2 Let S be a set of nonoverlapping submulticasts that are all internal or 
d-touching to a submulticast (i, V) such that S contains at most one submulticast 
for each multicast i' . Then ^ ( -, y , )sS w(i', rootii' ' , V')) < w(i, root(i, V)). 

Proof. Let r — root(i, V) and let (/', V') e 5 be a submulticast with root r' . 
Note that r' is a true descendant of r. Furthermore (/', V) and (i, V) overlap, 
thus max(i,r) and max(i',r') overlap. Hence, r' e R(i',i,r) and w(i',r') < 
max rWeR(( v >(>r) w(i',r"). 
Thus, 

w(i',root(i',V')) < max w(i',r") 

(i',V')eS (i',V')eS y ''' ' 

< 



Emax w{i' , r") 
. . M r"eR(i',i,r) 

w(i, root(i, V)) 



The following lemma is used repeatedly. 

Lemma 3 Let S be a set of non-overlapping submulticasts. 

Then for each submulticast (i, V), Xl(;',y')e5 a-touches (i,v) 

p(i',V',i,V) < 

2p(i, V). 

Proof. Let S' be the set of submulticasts of S that a- touch (i, V). Let r be the root 
and let s be the source of (i, V). Furthermore let e be the edge on the path from 
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s to r that is incident to r. Let (i*, V*) be the submulticast in 5' whose multicast 
tree contains e if such a submulticast exists. 

For each submulticast (/', V) ^ (/*, V*) in 5' let children{V , V) be the 
children v of r such that (v, r) belongs to T(i', V). Then the overlapping profit 
p(i' ', V, i, V) is at most the profit of (i, V) in the subtrees of children{i' , V'). 
Since the submulticasts in 5' are non-overlapping, the set children(i' , V) and 
children(i", V") are disjoint for any pair (/', V), (?'", V") e S' . Thus, 

^] p(i',V',i,V)</70\V). 

(i' > V')e5' > (i',V')#0'*.V*) 

Since p(i* , V*, i, V) < p(i, V), the lemma follows. ■ 

Next we show that the above algorithm gives a constant approximation of the 
optimum result: Let S opt be the set of submulticasts chosen by the optimum al- 
gorithm and let 5/ be the final value of 5. Note that every submulticast in S opt 
overlaps with a submulticast in 5/. We partition S opt as follows: Let S2 be the set 
of submulticasts in S opt that are d-touching or internal to a submulticast of 5/. Let 
Si be the set of submulticasts in S opt that are a-touching to a submulticast of Sf, 
but are not internal or d-touching to any submulticast of 5/. 

Lemma 4 p(Si) < 2ap(S f ) 

Proof. Let (?', V) e S\. Let S'^ be the set of submulticasts of 5/ that are a-touched 
by (/', V')- Note that S' f = 0(i', V) n 5/ and that the residual profit of (/', V) is 
not positive at termination. Thus, 

p(i',V')<a pd',V',i,V). 

(i,V)eS f nO(i',V) 

Thus, by Lemma 3, 

pGSi)= Y, P('".^')<a J2 E p(i',V',i,V) 

(i',V')eSi (i'V')eSi (i,V)eS f nO(i',V) 

= a J2 J2 p(i',V',i,V)<a J2 2p(i,V)<2ap(S f ) 

(i,V)eS f (i',V')eSi a-touches (i,V) (i,V)eS f 

■ 

Next we handle submulticasts in S2. 
Lemma5 p(S 2 ) < (10 + 2a)p (S f ) for a > 2. 
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Proof. The lemma follows from the following claim which we show by induction 
on the number of iterations j: let Sj be the set S after iteration j. Let Dj be 

the subset of S opt consisting of all submulticasts that d-touch or are internal to a 
submulticast in\Jk<jSk- Then 

E P(i',V')<W Pd,V)+a E p(i',V',i,V). 

(i',V')eDj (i,V)eSj (i,V)eSj (i',V')eDj:(i',V) a-touches (i,V) 

The claim holds before iteration 1 since So and Dq are empty. Assume the 
claim holds before iteration j. Let (i, V) be added to S in iteration j and let S de i 
be deleted. Let A = Dj \ Dj-\. Then the left side of the inequality increases by 
J2(i> v r )eA P(*'> ^ We nee d to show that the right side increases by at least so 
much. 

Each (?", V") £ Sj-i is partitioned into two submulticasts (/", V") and (?", V^) 
with (i", V{') e Sj and (i", V'{) e S dd . Note that p(i', V, i", V") < p(i' , V, i", V(')+ 
p(i', V',i", V 2 "). By Lemma 3 

E p(i',V',i",V^)<2p(S del ) 

(i",V^)eS dd (;',V")6A-i:(i',V) a-touches (i",V 2 ") 

Thus, 

J2 E Pd',v,i",v")< 

(i",V")eSj-i (i',r)eD M :(/', V) a-touches (/",V") 

E p(i',V,i",V{') + 2p(S del ). 

(i",V[')eSj (/',V')eA-i:(i'.V') a-touches (i" ,V'{) 

Thus, the total decrease of the right side by removing S de i from S is at most 
(10 + 2a) p(Sd e i)- It follows that the right side increases by at least 

l0p(i,V)+a E p(i',V',i",V")-(lO+2a)p(S del ). 

(i",V")eSj ((',V')eA:(;',V') a-touches (i",V") 

We know that p(i, V) > ap (S de i), which implies that 7 p(i, V) > (10+2a) p{S de i) 
for a > 2. 

We show below that 

E p(i',V')<3p(i,V)+a E p(i',V',i",V"). (*) 

(i',V')eA (i",V")eS/ (i'.VOeAKi'.V) a-touches (i",V") 

Thus, the inductive claim continues to hold. 

To show (*) we consider two cases. Let A = A n S de i. Let 5 be the rest of A. 
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Bounding A: As shown above, ap (Sdei) < pii, V). Thus, piA) < /?(?', V)/a < 
PU,V). 

Bounding B: Consider a multicasts (/', V) in B. When (i, V) is added, (/', V') 
does not belong to Sj-i and it is not selected by the algorithm. Thus, the ratio 
r(i', V) is at most the ratio r(i, V). Note that each multicast has at most one 
submulticast in B. Thus, using Lemma 2 it follows that 

Presii',V')= r(i',V')w(i',root(i',V'))< 

(i',V')eB (i',V')eB 

r(i, V) wU',rootii', V)) < r(i, V)w(i,root(i, V)) - p res (j, V). 

(i',V')eB 

Thus, 

J2 {pii',V')-a J2 pii',V',i",V"))<p(i,V)-apiSdei). 

(i',V')eB (i",V")eSj-inO(i',V) 

By definition no submulticast (/', V) in B d-touches or is internal to a submul- 
ticast in Sj-\. Thus, all submulticasts in Sj-\ n 0(i' , V) are a-touched by (?', V'). 
Using Lemma 3 for the second inequality in the same way as above shows that 

piB) < pii,V)-apiS d ei) + oc E pii',V',i",V") 

(i",V")eSj-i (i',V')eB:(i',V) a-touches ((",V") 

< pii,V)+ap{S d ei) + a E pii',V',i",Vl') 

(i",V[')eSj (i',V')eB:(i',V) a-touches (i",V[') 

< 2p(i,V) + a J2 E p(i',V',i",V{'). 

(i",V'{)eSj (i',V')eB:(i',V) a-touches (i",V{') 

Since p(A) = p(A) + piB), this shows (*). ■ 
It follows that 

piSopt) = piSi) + piS 2 ) < (4a + 10)/7(S), 
for a > 2. Choosing a — 2 gives an approximation factor of 18. 

2.2 The polynomial time implementation of the algorithm 

We are left with showing how to implement each iteration in polynomial time. 
Given a set S, the algorithm must compute at each step a submulticast of maximum 
ratio r(i, V). Note that it suffices to compute for each multicast i and each possible 
root position r* the submulticast best(i, r*) with maximum residual profit. The 
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desired submulticast is the one that maximizes over all multicasts i and all possible 
root positions r* of i the ratio 

p res (best(i,r*)) 
w(i, r*). 

Let T v be the subtree of T rooted at node v. 

We describe a polynomial time procedure based on dynamic programming that 
finds for each multicast i and for each possible root position r* a submulticast 
bestii, r*) with maximum residual profit. To be precise it suffices to consider all 
submulticasts of max(i, r*). 

Let (i, V) be the submulticast of i that currently belongs to S (V — 0 if no 
such submulticast exists.) To find best(i, r*) root the tree T at the source s of 
max(i, r*). (This rooting is completely independent of the rooting in the previous 
section.) We first compute a cost cost{e) for each edge in the tree. Then we 
construct a binary tree T from the original tree for use in a dynamic program. 
Next, we use the edge costs to compute two cost functions cost® and cost\ for 
each vertex v € T using bottom-up dynamic programming. Finally, cost} can be 
used to determine best(i, r*). 

Computing the edge costs: Use a depth-first traversal of the tree starting at s. 
When traversing edge e, a cost is assigned to e. The cost of edge e will be the 
profit that is lost by S when this edge is assigned to multicast i and thus is no 
longer available for submulticasts in S, ignoring the profit that 5 already lost on 
the edges along the path from e to s. 

Formally, we have the following definition: If e belongs to (i, V) e S, or e does 
not belong to any submulticast in 5, or e does not belong to the multicast tree of 
max(i, r*), cost(e) is 0. Otherwise, e belongs to the multicast tree of max(i, r*) 
and to a submulticast (/', V') of S with i' ^ i. Let e = (u,v) and let u be the 
parent of v. Let p(V , V, s, x) be the number of requests of (/', V) that use an 
edge on the path from s to x. Then cost(e) — p(i', V', s, v) — p(i', V, s, u). 

Obviously all edge costs can be computed in polynomial time. 

To illustrate the edge costs, assume for a moment that S contains only 1 sub- 
multicast (?', V'). Let s' be the source of V . Then the only edge costs that have 
non-zero value are the ones on the path from s' to s or incident to a node on the path 
from s' to s. Call these edge costs the edge costs for (/', V'). If 5 consists of more 
than one submulticast (ij, Vj), then the edge costs are simply assigned considering 
separately each (ij, Vj). Thus, a multicast i' ^ i of S with an edge of positive cost 
incident to v has its source in the subtree rooted at v. 

Constructing the binary tree T : For the bottom-up dynamic programming ap- 
proach we need to transform the tree into a binary tree T by introducing additional 
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nodes and edges. The cost of each additional edge is 0. Let v be a node in the orig- 
inal tree with d > 1 children. We call an edge incident to v unused if it does not 
belong to a submulticast of S or has cost{e) — 0. All the other edges incident 
to v have positive cost and belong to a submulticast of S. Recall that a multicast 
i' ^ i of S with an edge of positive cost incident to v has its source in the subtree 
rooted at v. Let q be the number of different multicasts with a edge of positive cost 
incident to vertex v. 

We replace v by a binary tree with q + 1 leaves, if there are unused edges inci- 
dent to v, and with q leaves otherwise. The first q are called special and correspond 
to a submulticast in 5 incident to v with at least an edge of positive cost; the last 
leaf L, if applicable, corresponds to all unused edges incident to v. Leaf L is the 
root of a binary tree with I leaves, where I is the number of unused edges incident 
to v. Each unused edge incident to v in T is incident to one of these leaves such 
that each leaf is incident to one of the unused edges. 

Let v(i') be the special node corresponding to submulticast (/', V') € S and 
assume t edges of (/', V) are incident to v. Then v(i') has two children, one 
corresponding to the edge e connecting v to the source of V and the other being the 
root of a binary tree with t — 1 leaves. The subtree for v(i') in T consists of v(i'), 
its children and the binary tree of t — 1 leaves. Edge e is connected to "its'" leaf 
and each other edge of (/', V') incident to v is incident to one of the other leaves 
such that each leaf is incident to one of these edges. Note that these nodes have 
one child each in T', i.e., are not leaves in T '. 

The root of the binary tree for v is labeled v. A leaf of the binary tree corre- 
sponding to a leaf v of T is also labeled v. 

All submulticasts are extended in a natural way to T '. 

Computing the cost functions cost 0 and cost 1 : We want to compute a submul- 
ticast of max(i, r*) with maximum residual profit. Thus, we find for each possible 
profit b,b = 1, 2, the submulticast (i, V*) that gets b profit and maximizes 

b — acost(b), where cost(b) is the minimum profit that S "looses" if (i, V*) is ac- 
cepted. This is equivalent to finding the submulticast with profit b that minimizes 
cost{b). The basic idea is to use bottom-up dynamic programming to compute for 
each node v' in T' the submulticast that minimizes cost v >(b), where cost(b) v ' is 
the minimum cost that 5 "looses" in the subtree rooted at v' if the submulticast is 
accepted. 

However, there is a complication at special nodes that requires the use of two 
cost functions, cost 0 , and cost*,: Let w be a node of T with 2 children y and z, both 
belonging to submulticast (/', V) € S. Let y lie on the path from v to the source 
of V . Let v' be the special node v{i') in the subtree of w in T and let V and r' be 
its children such that contains the node labeled y. Assume profit 0 < j < b is 
achieved in T y and profit b — j is achieved in T z . Then we do not want to add both 
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cost(w, y) and cost(w, z) to cost v >, since cost(w, y) considers already the cost of 
"loosing" all the requests of (/', V') whose path uses w. Instead we want to only 
add cost(w, y) and not cost(w, z) to cost v >. Thus, to "tell" the recursion on r' 
that it should not add in any more "lost profit" of (/', V) we use the cost function 
cost®,. If the recursion on r' is allowed to add in more "lost profit" of (/', V) (since 
e.g. no profit was achieved in T y ), we use cost).,. Note that this complication arises 
only at special nodes. 

We need to use cost 0 in T' r , until we reach the leaf x' of the subtree of v{i') in 
T whose edge e to its child corresponds to (w, z). When computing cost 0 , we do 
not add in the cost cost (w, z) of e and then use cost 1 to recurse on the children of 
x' . Thus, cost 0 , is only needed at nodes belonging to the subtree of v{i') in T for 
some submulticast (/', V) € S. 

We next give the formal definitions. When we want to avoid that the dynamic 
program chooses a specific cost',(b), t — 0, 1, combination, we set its value to oo. 

(1) For all v' e T and for t = 0, 1: cost' v ,(0) = 0. 

(2) For all v' e T such that there are less than b requests of max(i, r*) in the 
subtree of v in T and for t — 0, 1: cost' v ,(b) — oo. 

(3) For any b > 0, for t — 0, 1, and for all v' e T with one child, call the 
missing child r' . We assume that cost' r , (b) — oo. 

(4) For all non-special v' e T', let c be the number of requests of max (7, r*) at 
v' . Then, 

cost),(b) — mm{mmy = i .j,- c -i}Cost(v', I') + costj,{j) + cost{v' , r') 

+cost),(b - c - j), 
cost(v' ', V) + cost\,{b — c), cost(v', r') + cost),{b — c)} 

For all special v' — v{i') e T , let /' be the child of v that leads to the source 
of multicast Then 

cost v ,(b) = min{min{j = i .. ^ijcosf (V, /') + cost v {j) + cost°,(b — j), 
cost (v', I') + cost), (b), cost (v' ,r') + cost},(b)} 

(5) For all v' e T whose children are not labeled by a vertex of T, let c be the 
number of requests of max (7, r*) at v' . Then 

cost°,(b) = mm {j= o,..,b-c}(cost 0 (j)+cost°,(b-c- j)) 

For all v' e T whose children are labeled by a vertex of T 

cost 0 , (b) = min (i - = o,..,/,} (cost), (j) + cost), (b - j)) 

In (5) we are not adding in cost(v' ', V) and cost(v', r'). 
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Computing cost' v ,{b) for a given v' and b by bottom-up dynamic programming 
takes time 0(n). Since there are n different values for b and 0(n) nodes in the 
tree, we spend time 0(n 3 ) for multicast i and possible root position r* to compute 
all cost 0 and cost 1 values. 

Lemma 6 Let s' be the node of T labeled with the source s of i. The largest 
residual profit of any submulticast ofmax(i, r*) is ma.Xb(b — acost],(b)). 

Proof. Let T], be the subtree of T rooted at v' . We prove the following claims by 
bottom-up induction: 

(1) For each vertex v' € T cost],(b) equals 

• the minimum number of requests in T', of submulticasts in S that cannot be 
accepted if a submulticast of max(i, r*) accepts b submulticasts in T' v ,, 

• and oo otherwise. 

(2) For each vertex v' e T that belongs to the subtree of v{i') for some 
(i', V) e S,cost°,(b) equals 

• the minimum number of requests in T' v , of submulticasts in S \ (/', V') that 
cannot be accepted if a submulticast of max(i, r*) accepts b submulticasts 
in K, 

• and oo otherwise. 

The second part of each claim follows by the definition of cost], and cost 0 , we 
only need to show the first part. 

(1) For the basis of the induction, if v' is a leaf of T it is labeled with a leaf 
v of T. We can assume that b requests of max(i, r*) are at v' . Note that any sub- 
multicast (/', V') of S with a request at v' also must have its source at v' . Thus, the 
request of {V ', V) can be accepted, even if b requests of max(i, r*) are accepted. 
Hence, cost^ib) = 0, and the claim holds. 

If v' is an internal vertex of T , we distinguish two cases, depending on whether 
v' is a special node or not. Let /' and r' be the two children of v' . 

Case 1: v' of T is not a special node. If v' is not a special node of T', then 
cost I, (b) is defined by the first part of rule (4). 

If only one edge (i/, /') is incident to v' then cost^Qj) = cost(v', l')+costf,(b— 
c) and the claim holds by induction on vertex Otherwise, cost^,(b) is defined as 
the minimum over all the possible partitions of b — c between the two subtrees Tj, 
and T' r ,. Since inductively cost], and cost], are equal to the minimum profit that S 
looses in T], and T],, the claim holds also for cost],. 
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Case 2: v' of T is a special node. If v' — v(V) is a special node of T' , then 
costl,{b) is denned by the second part of rule (4). Let (/', V) € S. 

The minimum number of lost requests is the minimum over all the possible 
partitions of b between the subtree T[, that contains the edge leading to the source 
of i' and the subtree T' r , containing the other edges of T(i', V'). Let w be the node 
in T to whose binary tree n T v' belongs and let y be the child of w such that T y 
contains the source of V . Note that in T the edge e from I to its only child in T 
has cost cost(w, y). 

If the submulticast of max(i, r*) accepts requests in T{, then the submulticast 
contains e. By the definition of cost{w, y), e's cost cost{w, y) contains already 
all the profit of (/', V') lost in T' r ,. Thus, no further lost profit in T' r , should not be 
added to cost],. By the inductive claim, cost®, computes the minimum profit lost 
by S in T' r ,, not considering the profit lost by the submulticast to which edge (1/ , r') 
belongs, i.e., the profit of (/', V). 

Since cost], (b) is again defined as the minimum over all the possible partitions 
of b — c between the two subtrees, using cost® for T], and cost®, for T' r ,. Thus, the 
claim follows as in case 1 . 

If, however, the submulticast of max(i, r*) does not accept requests in T' v , then 
the lost profit equals cost(v', r') + cost]., (b — c) and the claim holds by induction 
on r'. 

(2) The edges in T with non-zero cost are edges that connect nodes labeled 
by a vertex of T to their parents. Let e — (x', y') be such an edge belonging to 
submulticast (/', V) € S. Note that the higher endpoint x' of e belongs to the 
subtree of v(i') in T and that all children of x\ including y' , are labeled with a 
node in T. Then the cost of no further edge in Ty is related to the profit of (/', V), 
i.e., if (/', V) were the only submulticast in 5, all edges in T y , would have zero 
cost. 

To prove the claim for cost®, (b) we distinguish again two cases. 

Case 1: v"s children are labeled by vertices of T. Note that v' is not labeled 
and thus, there is no request of max(i, r*) at v' . By the above observation, no edges 
in the subtree of i/'s children have costs related to the profit of (/', V'). Using 
induction on /' and r', the minimum number of requests in T', of submulticasts 
in S \ (i', V') that cannot be accepted if a submulticast of max(i,r*) accepts b 
submulticasts in T' v , is given by the minimum over all 0 < j < b of costj,(j) + 
cost\,{b — j). Thus, the claim holds. 

Case 2: v"s children are not labeled by vertices of T. Let (/', V) be the 
submulticast such that v' belongs to the subtree of v(i'). Note that the edges from 
v' to its children have cost 0 and that v' is an internal node of the subtree of v(i'). 
Thus, the children /' and r' of v' also belong to the subtree of v(i'). Inductively their 
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cost 0 equals the minimum number of requests in T{, resp. T' r ,, of submulticasts 
in 5 \ (/', V) that cannot be accepted if a submulticast of max(i, r*) accepts b 
submulticasts in T[, resp. T' r ,. Hence the minimum for T', is given by the minimum 
over all 0 < j < b — c of cost} (j) + cost 0 (b — c — j). Thus, the claim holds. ■ 

Determining best(i, r*): By Lemma 6 the largest residual profit of a submul- 
ticast of max(i,r*) is found by determining the value b\, est that maximizes the 
function b — acost\{\)). The submulticast best(i, r*) is found by reconstructing 
which edges were used for the computation of cost] (bbest )- 

3 The online algorithm for trees 

We describe an online algorithm ST for unit-capacity trees. For sake of simplicity 
we assume that the sources as well as the members of the multicasts are leaves 
of the tree. The general case can be easily reduced to this setting. The algorithm 
consists of two stages. The first stage is a randomized procedure that selects a 
subset of requests that will form the set of "candidate" requests C for the second 
stage. The second stage decides for each request in C whether to accept or reject 
it. The requests accepted by the second stage are the requests accepted by ST. 

The first stage of the algorithm. 

The first stage runs the multicast algorithm, called MC, of [12] on a tree with 
capacity u, where u — log/x and /x — 4m 6 M. Then stage 1 adds the accepted 
requests to C. 

On a graph where each link has capacity u, MC achieves a competitive ration 
of 0((log n + log log A / J)(log n + M) log n). However, when applied to trees and 
compared to an offline algorithm with link capacity 1, MC's competitive ratio 
is 0(log« + logM): A first 0(log«) factor is saved since in a tree both the 
online algorithm and the offline algorithm connect the requests accepted by both 
algorithms to the root through the same multicast tree. (In particular, in this setting 
the claim of Lemma 5.1 of [12] can be proved without the Oilogn) factor). An 
additional O (log /x) factor is saved since the online algorithm has O (log /x) more 
capacity on the edges. This is proved in the same way as for unicast (see [16] and 
[15]). (In particular in [12] a factor of 0(logn + log log M) is saved in Claim 
5.7). Thus, MC, when applied to a tree network with 0(log /x) more capacity on 
the edges is 0(log« + log M) -competitive. 

The second stage of the algorithm. 

In the second stage all the vertices of the tree are partitioned into 0(log n) different 
classes, by recursively finding a balanced tree separator. A balanced tree separator 
[22] is a vertex whose removal splits the tree into pieces of at most |?i vertices. 
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The tree separator of T is assigned level 0. Removing the level-0 node splits T into 
subtrees oflevel-1. In general, the tree separators of the level- j trees are assigned 
level j and removing them creates subtrees of level j + 1. After a logarithmic 
number of recursions the trees obtained are single vertices and the procedure stops. 
A similar technique is also used in [5] for the online call-control problem on trees. 

Each of the requests in C is assigned to one of 0(\ogn) classes as follows. A 
request from vertex v to multicast source s is assigned to class j if the vertex of 
lowest level on the path from v to s has level j. 

One of the O(logTi) levels is chosen at random by the algorithm before to 
process the sequence of requests. We denote with i the level selected. 

Stage 2 decides to accept or reject a request in C using the following algorithm: 

1. If the request is not of level i then reject it and stop. 

2. If the request is the first one of multicast i seen at this step, then: 

• Flip a coin with success probability -. 

• If success then pass to step 3 the current and all the future requests to i 
seen at this step; otherwise reject all the future requests to i seen at this 
step and stop. 

3. Accept a request from vertex v to source s if no edge on the path from v to s 
is assigned to other multicasts; otherwise reject. 

The following lemma bounds the expected number of requests accepted by ST. 

Lemma 7 The algorithm ST expects to accept an 0 ^ og l niog ^ fraction of the re- 
quests accepted by MC. 

Proof. The expected number of requests passed from step 1 to step 2 of stage 2 is 
a 0 (io gW ) fraction of the requests accepted by MC, since the level of each request 
is chosen uniformly at random with probability Q ^ gn) . The expected number of 

requests passed from step 2 to step 3 is a fraction - of all the requests received 
from step 1 . This follows since all the requests for a multicast are passed to step 3 
with probability -. 

We are left to prove that each request received at step 3 is accepted with con- 
stant probability. Consider a pair of requests for different multicasts. If they in- 
tersect, the intersection is on an edge adjacent to a level i vertex. Each request is 
connected to the source through at most two edges adjacent to a level i vertex. The 
probability that a request is accepted is then given by the probability that these 2 
edges are not assigned to other multicasts. 
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Any edge of the tree is part of at most u multicasts given the maximum capacity 
of the edges of the tree for the online algorithm in the MC solution. The edge is 
assigned to each of these multicasts with probability - , if the algorithm decides to 
accept requests from this multicast. When a request arrives at step 3, the probability 
that an edge adjacent to the level i vertex on the path to the source has not been 
assigned to a different multicast is then lower bounded by (1 — -)" > l/e. The 
probability that both edges adjacent to a level i vertex have not been assigned is 
then lower bounded by 1 /2e, thus proving the claim. ■ 

This leads to the following theorem: 

Theorem 8 There exists an O ((log n + M) (log n + log log M) log n)-competitive 
algorithm for multicast routing on unit-capacity trees. 

A randomized lower bound of Q (log n log M) follows from [3] since the mul- 
ticast routing problem on trees of unit capacity contains the online set cover prob- 
lem. 

4 The offline algorithm for a mesh 

We present an O ( (log log n) 2 ) -factor approximation algorithm on meshes. The 
algorithm partitions the mesh into squares of logarithmic size and divides every 
square into an external and an internal region. The external region of a square 
is reserved to route requests into, out of, and through the square. It is called the 
crossbar structure of the mesh. To avoid edge-overlapping we discard all requests 
whose request node or source belongs to an external region. From the remaining 
requests the algorithm considers with equal probability either only short requests 
directed from a request vertex to a source in the same square, or only long requests 
directed from a request vertex to a source in a different square. A randomized 
rounding technique based on a novel formulation of the multicast routing problem 
as an integer linear program is then used in conjunction with the use of a simulated 
network with edges of higher capacity. 

Let G denote the n x m two dimensional mesh such that m — @(n). Wlog 
m > n. We assume n sufficiently large such that [log log log n J > 3. Define 
B = 4 [log 7i J , f(k) — k div 9B, and f\ (k) = k mod 9B. Given two integer values 
a and b an (a, b, B)-partitioning of the mesh G is a partitioning into f (n) x f(m) 
submeshes of 0(B) size induced by segmenting the horizontal and the vertical 
side of the mesh. The horizontal side is partitioned into a segment from column 
1 to column a, followed by f(m) — f\(m) contiguous segments of size 9B, by 
f\(m) — 1 segments of size 96 + 1 and by a last segment of size 9B + 1 — a. 
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The vertical side of the mesh is partitioned in a similar way with b used in place 
of a and n instead of m. By abuse of notation every resulting submesh is called a 
square, even though the size of the two sides of a square may differ. Note that each 
node belongs to exactly one square while an edge can be incident to nodes of two 
different squares. We denote the square containing a node t by S t . 

The border of G is formed by all nodes of degree less than 4. The 1st ring in a 
square 5 consists of all nodes of S that are incident to a node outside of S or belong 
to the border of G. Recursively, the i-th ring of S with i > 1 consists of all nodes 
of S that are incident to a node of ring i-l of S. The innermost ring of a square 
is either a single vertex or a line of nodes. A ring that is not the innermost ring 
either forms a rectangle (if its square does not contain nodes of the border of G) 
or forms a rectangle with one or two borders of G. In any square S we define two 
regions R l s and R 2 S . Region R l s consists of rings from 1 to B, region R 2 , contains 
all remaining rings of 5. Ring B + 1 is the border of /?|. 

Let A be the sequence of requests. The algorithm chooses two integer values 
a and b uniformly at random in the interval 2B + 1, IB and constructs an 
(a, b, B) -partitioning. Then it discards all requests (t, s) such that either t or s 
does not belong to the R 2 region of its square. The set of remaining requests 
is denoted by C. The following lemma implies that for any input sequence A, 
E[\OPT(C)\] > \OPT(A)\/25 since for every request (t, s) the probability that 
t and s both belong to R 2 is at least 1/25. 

Lemma 9 Given two nodes t and s they both belong to region R 2 of their squares 
with probability at least 1/25. 

Proof. We prove separately that the x and the y coordinates of t and s are within 
region R 2 with constant probability. Consider interval / = [2B + 1, IB] from 
which a is chosen uniformly at random. Since region R l has width 2B, x t falls 
within region R 2 if a is chosen out of a subset I s of / of size at least 3B. Analo- 
gously, x s falls within region R 2 if a is chosen out of a subset I t of / of size at least 
3B. Since / has size 5B, for at least B out of 5B possible values of a, i.e. with 
probability at least 1/5, both x t and x s fall with region R 2 . By symmetry, y t and y s 
fall within region R 2 with probability at least 1/5. It follows that with probability 
at least 1/25 every request has both endpoints in region R 2 . ■ 

By the choice of a and b, at least 2B rings are contained in a square. Thus, 
region R x s is always complete, while region 7?| is formed by at least B rings. 

The set of requests C is partitioned into the set of long requests £ = {(t, s) e 
C : S t ^ 5,, } and the set of short requests S — {(t, s) e C : S t — S s }. For i e M, 
denote by — [t : (t, s,-) e C] the set of request nodes of multicast i. 
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The algorithm decides with equal probability to accept either only long requests 
or only short requests. We describe next the algorithm specialized for long requests 
then the algorithm specialized for short requests. 

4.1 Long requests 

Our approach is to transform the problem into a problem on a network G' , then 
formalize the problem on G' as IP, relax it to an LP, solve the LP, and round the 
LP solution probabilistically. Finally we use the rounded solution to construct a 
solution in G. 

Mesh G is transformed into a network G' — (V, E') as follows. For every 
square 5 of G, network G' contains vertex xs- The vertices xs and xy of two 
adjacent squares 5 and 5' are connected by an edge of capacity [log n\ . For every 
square S of G, every vertex u of region 7?| has a corresponding vertex u' in G' . 
For any pair of adjacent vertices u, v in /?|, vertices u' , v' in G' are linked with an 
edge of unit capacity. Every vertex of the border of R 2 S is connected to x$ by an 
edge of unit capacity. For every multicast i and for every request vertex u e C, • , a 
vertex u • is connected to vertex u' with a unit-capacity edge. The input sequence 
for the multicast routing problem on G' is created by transforming every request 
(t, s) e Ci into a request {t[, in G' . 

The next step is to formulate the multicast routing problem in G' as a packing 
problem: For every multicast i consider the set % consisting of all trees containing 
Si and a non-empty subset of the request nodes t[. Since we introduced the nodes 
t'^TiDTj = 0 for i ^ j. Let T = U ieM Ti. Denote by V(T) the set of vertices 
of tree T and by E(T) the set of edges. Let the benefit of tree T e % be b(T) — 
\{t( e V(T) : t e 

We associate a variable xj e {0, 1} with every tree T e T. Edges of E' are 
subject to constraints: 

x t< c(e), ye e E'; (1) 

TeT:eeE(T) 

x T < 1, Ve e E',Vi e M. (2) 

TeTr.eeE(T) 

The multicast routing problem consists in maximizing the following objective 
function: 

TeT 

The fractional packing problem is obtained replacing the integrality constraints 
on variables xt with constraints xt > 0. We also drop edge constraints (2) to ob- 
tain a linear program where every edge is involved in a single constraint and solve 
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it using the polynomial time e -approximation algorithm of Garg and Konemann 
[10] based on duality. The algorithm assigns a dual variable y(e) to every edge 
e e G' . The central step of the algorithm requires to find the variable xj with 
maximum ratio opt = b(T)/ Y2 eeT y(e). This problem is NP-hard, since it cor- 
responds to finding the densest tree in the network G' where edges are weighted 
with the values of the dual variables. However it is easily checked that if we find a 
variable Xf with b(T)/ Xl ee f y( e ) — opt /a then the algorithm of [10] also gives 
a 1 + e -factor approximation of the fractional multicast problem on G' . 

As was previously observed by [2] a fc-MST algorithm can be used to solve 
the densest tree problem. The 3-approximate fc-MST algorithm of Garg [9] can be 
adapted to work in the case the k vertices are restricted to be request vertices of 
the same multicast. Thus, for every multicast i and every k — 1, the 3- 

approximate fc-MST algorithm is applied. It finds the tree T{ (k) spanning k request 
vertices of such that ^ e67 -. (yt) v(e) < 3opttj, where opt^j = mm{^2 eeT y(e),b{T) — 
k,T e %}. Then the tree of maximum ratio k/ ^2 eeT .^ y(e) over all k and all i 
is selected. Since this ratio has value at least opt/3, this results in an 1 + e-factor 
approximation algorithm for the fractional multicast problem. 

Denote by x T the solution of the fractional multicast routing problem 1 . 

Let s = l/((cloglog7i) 2 ), where c > e is an appropriate constant to be fixed 
later. The algorithm rounds variable xj to xj — 1 with probability sx T , and to 
xt — 0 with probability 1 — sx T . Let G; be the graph with edges E{Gi) = 
[ ~ > TeT i -Jc T =iE(T). For any multicast i the algorithm selects an arbitrary spanning 
tree T, of graph G ; . 

The trees T,- do not form the integral solution since there might be violated edge 
capacities for the unit-capacity edges. However, as described below, the requests 
accepted by the final solution form a subset of the requests accepted by the trees T i 
and the size of the subset is a constant fraction of the requests accepted by the trees 
of Ti . To prove the approximation bound we show in the appendix that the value 
S of the optimal solution of the fractional packing formulation is within a constant 
factor of the optimal integral solution on the set of requests £ (Lemma 17), and that 
the expected number of request nodes contained in the trees T,- is within a factor 
of 0((loglogn) 2 ) of the value S (Lemma 18). 

Let C\ — Li n V(Tj) be the set of request vertices to multicast i that are 
spanned by tree T,- if no edge (x$, xy) of G' is violated, C\ = 0 otherwise, and let 
C 1 = XlieX A • W e prove that with at least constant probability no edge (xs, xy) 
of G' is violated , i.e., C l ^ 0 (Lemma 19). If C 1 = 0, the algorithm terminates 
without accepting any multicast. Otherwise, each request of accepted by the 

'Let for some (', {r (1 \ . . . , r (;) ) be the set of all the trees of % with — 1, for all 1 < I < j . 
Then T ([} U . . . U T (j} forms the multicast tree for multicast 
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final solution is routed along a path containing the same edges (xs, xy) as its path 
to the source in T, . The remaining problem is to route request nodes and sources 
to the border of their square. This problem was called the escape problem. The 
solution proposed by Kleinberg and Tardos for unicast routing [14] uses the fact 
that the benefit collected in a square is of the same order as the maximum flow 
that can be routed through the border of the square. This is not true for multicast 
routing: the maximum benefit that can be collected in a square is 0(log n), while 
the maximum flow that can be routed through the border of the square is 0(log n). 
Thus, using the same maximum flow approach as in [14], which means routing 
request nodes individually out of the square, leads to an O (log n) -factor approxi- 
mation. We give instead a recursive approach that achieves a 0((log log «) 2 )-factor 
approximation. 

Our basic idea is to recursively partition every region R 2 S into subsquares of size 
0(loglog7i), and each subsquare Q into a subregions R Q and R 2 q. Requests are 
routed to the border of R 2 q on the same path as in the trees T, and from there they 
use rings of the R 1 regions of subsquares to reach the border of /?|. The sequence 
of subsquares used for a request is the same as on the path in T,. Therefore we 
enforce that the trees r ; are edge-disjoint within the R 2 Q regions and that there are 
at most 0(log log n) trees connecting between any two neighboring subsquares. 

We next give the details: A gate vertex for multicast i in square 5 is a vertex 
q on the border of R 2 S such that (q, xs) belongs to T{. Let g{p) be the gate vertex 
closest to node p e Li on the path from p to in J, closest to p. Let g(s{) be a 
gate vertex closest to 57 on a path from s\ to a node outside S s . in T,-. The escape 
problem is the problem to connect each request node p to g(p) and to connect each 
source to s to at least one g(si). Let S be a square whose region 7?| consists of a 
k\ x li2 mesh. Let k — min(^i,^2) and let Bs — 4[log£J. Note that k > B. 
The algorithm uniformly chooses two integer values as and bs from the interval 
IBs + !>•••! 7Bs for each square S and creates an (as, bs, Bs) -partitioning for the 
region /?|. Each submesh Q created by this partitioning is called a subsquare. If Q 
does not contain nodes of the border of R 2 S , region Rq of subsquare Q consists of 
rings 1 to Bs, region Rq consists of the remaining part of Q. If Q contains nodes 
of the border of R s , we need a different definition: Let Q be a £3 x £4 mesh with 
£3, £4 < 9Bs- Assume Q is extended into a 9Bs x 9Bs mesh Q' by nodes outside 
of Rg. Regions Rq, and Rq, are defined as above. Region Rq is then R 1 q, Pi R 2 , 
and region Rq is Rq, n /?|. By the choice of as and bs and the definition of Rq 
there are gate vertices in S that belong to Rq if subsquare Q lies on the border of 

(1) The algorithm rejects all requests whose source or request node belongs 
to the region R l 0 of their subsquare Q. The remaining set of requests is called 
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C 2 . A subsquare is called invalid if one of the edges of G' incident to a node in 
the subsquare belongs to more than one tree T,-. Since every edge is assigned to 
a tree with probability 0(l/((loglog«) 2 ), a subsquare in not invalid with at least 
constant probability (Lemma 23). (2) Every request node belonging to an invalid 
subsquare is discarded and every multicast whose source belongs to an invalid 
subsquare is discarded. The set of remaining requests is called C? . A square 5 is 
called invalid if there exists a pair of neighboring subsquares Q and Q' of S such 
that more than B$/4 trees T, contain an edge incident to Q and Q' . Every square 
is proved to be not invalid with at least constant probability (Lemma 24). (3) Every 
request node belonging to an invalid square is discarded and every multicast whose 
source belongs to an invalid subsquare is discarded. The set of remaining requests 
is called C 4 . (4) All request nodes p in C 4 such that g(p) belongs to Rq for some 
subsquare Q are discarded and multicast i is discarded if all gate vertices of square 
5 containing source belong to R$. The set of remaining requests is called C 5 . 
(5) Finally all requests p of C 5 such that g(p) belongs to an invalid subsquare 
are discarded, and multicast i is discarded if in a square S containing source Sj 
all gate vertices in S connected to 57 in Tj belong to invalid subsquares. The set 
of remaining requests, called C 6 , is accepted. Set C 6 is expected to be at least a 
constant fraction of set C 4 (Lemma 25 and 26). 

4.1.1 Routing Algorithm 

We next show how to route the accepted requests, i.e., how to construct the multi- 
cast tree T, for multicast i . The basic idea is to route using the same squares and 
subsquares as the tree T, but to route "through" a square S in R l s and "through" a 
subsquare Q in Rq. 

Let t\ be the subtree of J spanning 57 and the vertices in J \ D £ 6 , the request 
vertices accepted for multicast i. We first explain how to connect gate vertices. For 
every vertex x$ of T i the algorithm assigns a ring r$j of region R^ to multicast 

i. For every edge (xs, xs>) of t{ let r be the ring of [rsj, rs>j} with larger index. 
The algorithm assigns to multicast i the straightline extension of r from one corner 
of r to the other ring, thereby connecting r$j and r$'j. For a vertex xs of t( and 
every gate vertex q of multicast i belonging to the horizontal (vertical) side of the 
border of /?| and to a valid subsquare of S, the algorithm assigns to multicast i the 
vertical (horizontal) straightline path from q to rsj. Note that all gate vertices of 
multicast i in S are connected to the same ring rsj. 

Next we describe how to connect a request node p of multicast i in S to its 
gate vertex q = g(p). Let Q be the subsquare of p and Q' be the subsquare of q. 
Recall that g{p) lies in region R 2 0 . Let P be the path between p and q in T, and 
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let e p be the edge on P closest to p and incident to exactly one node of R 2 q. Let e q 
be denned symmetrically. 

— / 9 

For every subsquare Q such that (i) T i contains a vertex of Rq and (ii) 57 or a 
request node of J belongs to S we assign a ring tq i of Rq to multicast i, where 
S is the square containing Q. 

If Q — Q! an d P is connected to q in T • by a path in Rq then p is connected 

to g in the same way in 7). If 2 — 2' and p is connected to g in t{ by a path 
P with edges outside 7?g, we connect both p and g to rg , in 7}. Request vertex 
p is connected to tqj in 7} by the part of P from p to e p and then the straightline 
extension from e p to rg,-. Gate node g is connected to rg; in 7} in an analogue 
way. 

If Q 7^ Q' 5 then /? is connected to g by the same path as above to rg,-, then the 
path P' described below to rg',,-, and finally the same path as above from rg' ,- to q. 
To construct path P' the above algorithm to construct a path between gate vertices 
is applied: for each neighboring pair (Q\, Q2) of subsquares on P a horizontal 
or vertical connection between rings rg M - and rg 2>( - is assigned to the multicast. 
Path P' is a simple path between rg,- and rg^- created from the assigned rings and 
connections. 

Next we describe how to connect the source of i to one of the gate vertices q 
of i in 5. This suffices since all gate vertices of i in S are connected to the same 
ring rsj. Recall that one of the gate vertices, say q, belongs to /?|. The algorithm 
connects s to q in the same way as it connected a request node p to q. 

4.1.2 Proof of Correctness 

We show that the resulting multicast trees 7} are edge-disjoint. The multicast tree 

for multicast i is contained in the rings r$j of squares and rings rgj of subsquares 

assigned to i, the assigned horizontal or vertical connections between assigned 

— f 9 

rings and from gate vertices or edges e p to rings, and paths of T i in regions Rq of 
subsquares. 

We distinguish the following seven types of paths used by the multi casts and 

show below that (1) for each square 5 the algorithm can assign a unique ring r$j if 
— f 

x$ G T ) (Claim 10); (2) for each subsquare Q the algorithm can assign a unique 
ring rQj if T i contains a vertex of R 2 q and Si or a request node of belongs to the 

square of Q (Claim 1 1); (3) for every edge (x$, xy) in t{ the algorithm can assign 
a unique horizontal or vertical connection between r$j and rs'j (Claim 12); (4) 
for each gate vertex q of multicast i in square S the algorithm can assign a unique 
horizontal or vertical connection between q and r$j if q belongs to a valid sub- 
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square (Claim 13); (5) for every neighboring pair (Qi, Qi) of subsquares and for 
each multicast i the algorithm can assign a unique horizontal or vertical connection 
between rg 1>( - and rg 2> ; if they both exist, (Claim 14); (6) if p is either a request 

vertex in T i or a gate vertex such that a request node of T t in S p connects through 
p to si , then the algorithm can assign a unique horizontal or vertical connection 
between e p to tqj, where Q is the subsquare containing p (Claim 15); (7) if p 

— / — / 

is either a request vertex in T i or a gate vertex such that a request node of T { in 

S p connects through p to 57, then the algorithm can uniquely assign the path in T i 

from p to e p or to g(p) if this path is contained in Rq, where Q is the subsquare 

containing p (Claim 16). Since the set of edges that can be used for type-(j) paths 

is disjoint from the set of edges that can be used for type-(j') paths with j ^ j', 

proving the claims shows the correctness of the algorithm. 

— / 

Claim 10 The number of trees T i including vertex xs for a square S is at most 
the number of rings B of region R l s . 

Proof. The number of trees T i including an edge e — (xs,xy) does not exceed 
c(e). Every tree containing x$ but not a terminal in 7?| takes 2 units of capacity 
on edges (xs, xs'). Every tree containing a terminal in 7?| takes at least one unit of 
capacity on edges (xs, xs>). Since the overall capacity of edges (xs, xs') is at most 
4[log n\ = B, a ring of can be assigned to every tree T • containing x$. ■ 

Claim 11 There are at most B$ trees t\ such that (i) t( contains a vertex of 
region Rq of a subsquare Q and (ii) s, or a request node of T i belongs to S, 
where S is the square containing Q. 

Proof. Assume by contradiction that a subsquare Q does not fulfill the claim. The 

— / 9 
trees T i that contain a vertex of region Rq each use at least one edge with exactly 

one endpoint in Q. Thus there exists a neighboring subsquare Q' of Q in S such 

that more than B$/4 trees t{ contain an edge incident to Q and Q' . It follows 

that S is invalid and the algorithm discards all request nodes in S and all multicasts 

whose source is in S. Thus, none of the trees t( contains a request node or source 

in 5. Contradiction. ■ 

Claim 12 Let S and S' be a pair of adjacent squares. For each multicast i with 
edge (xs,Xs') e t( the algorithm can assign a unique horizontal or vertical 
straightline connection between r$j and ry , . 
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Proof. Since each ring is assigned to at most one multicast, there are at most two 
multicasts that potentially want to use a straightline extension, namely one from 
ring r in S and one from ring r in S'. Since there are two "corners" of ring r, there 
are two straightline extensions from ring r between S and S' and hence each of the 
two multicasts can be assigned a unique one. ■ 

Claim 13 For each gate vertex q of multicast i in a valid subsquare Q the algo- 
rithm can assign a unique horizontal or vertical connection between q and r$j, 
where S is the square containing Q. 

Proof. Since the subsquare Q of q is valid, the edge (q, xs) belongs to at most one 
multicast and the horizontal or vertical connection between q and any ring of R l s 
can be uniquely assigned to this multicast. ■ 

Claim 14 For every neighboring pair (<2i , Q2) of sub squares and for each multi- 
cast i the algorithm can assign a unique horizontal or vertical connection between 
r Qui andr Q2j . 

Proof. The same proof as for Claim 12 applies. ■ 
For the following two claims, let i be a multicast, and let p either be a request 
node in t{ or a gate node such that a request node p' of t{ in S p connects through 
p to St. Let Q be the subsquare of p. If p is a request node, let e p be the edge 

closest to p and incident to Q on the path from p to s, in T i . If p is a gate node, 
let e p be the edge closest to p and incident to Q on the path from p to p' . 

Claim 15 The algorithm can assign a unique horizontal or vertical connection 
between e p to rQj. 

Proof. If p is a request node, then Q is valid since otherwise p would not belong to 
— f 

T i . If p is a gate node, then Q is valid since otherwise the request node connecting 
through p would have been discarded. 

It follows that each edge in Q belongs at most one tree T,, and the algorithm 
assigns the straightline extension from e p to rQj only to this multicast. ■ 

Claim 16 If the path in T, from p to e p or to g(p) is contained in R 2 q, it is edge- 
disjoint from any other multicast tree. 

Proof. The same argument as in Claim 15 shows that each edge in Q belongs at 
most one tree 7Y Thus the edges on the path in T i from p to e p or to g(p) are used 
only for one multicast. ■ 
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4.1.3 Proof of the approximation ratio 

We prove that the algorithm gives an O ((log log n) 2 ) approximation using the 
following steps: Recall that 5 is an upper bound of the optimal fractional solu- 
tion. Lemma 17 shows that the value S is within a constant factor of the opti- 
mal integral solution on set of requests C Lemma 18 shows that the expected 
profit of the trees T i is at least sS/e 2 . Lemma 19 shows that the capacity of no 
edge (xs, xy) is violated with at least constant probability. Lemma 21 shows that 
E[\C l \] > sS/e 2 . Lemma 22 shows that E[\C 2 \] > \C l \/25, Lemma 23 shows 
that E[\C 3 \] > \C 2 \/2, Lemma 24 shows that E[\C 4 \] > |£ 3 |/4, Lemma 25 shows 
that E[\£ 5 \] > |£ 4 |/25, and finally Lemma 26 shows that E[\£ 6 \] > |£ 5 |/4. 

Lemma 17 Let OPT be the optimal integral solution on set of requests C. Then 
OPT < 375. 

Proof. Edge e = (xs,xs f ) has capacity c(e) — [log nj in the fractional packing 
formulation on network G'. The border between square 5 and 5" has size 9B + 1 < 
37 [log n\ . The optimal integral solution for the multicast routing problem on set 
of requests C can be transformed into a feasible solution for the fractional packing 
problem assigning capacity 1 /37. The claim of the lemma follows. ■ 

Lemma 18 Letb = J2t, b ^i)- Then E[b] > sS/e 2 . 

Proof. Note that E[b] equals the expected number of request nodes contained in 
the trees 7\ . 

Let t be a request node of and let x t { — zZ,TeTrt'.eV(T) x t- We snow below 

that 

— SXf 

Pr[t[ e V(Ti)] > -5*-. 

e L 

Summing over all request nodes t- it follows that 

Em^Z-^^En^sxy^^ss/e 2 . 

i t i TeTj 

We are left with showing the bound on Pr[t- £ V(T,-)]. Let {T,- : xj t > 0, i — 
1, .., r} be the set of trees in the fractional solution spanning vertex t[. If r = 1, 
the claim is true, since Pr[t- e V(Tj)] = sx tj . Consider r > 2. The probability 
that vertex t- is in graph G,- is lower bounded by J2j=i r ^r\xj j = 1 andx^ — 
0, Vi ^ j] — >~2j=i r sx t ri(^;(l — sxj.). This expression is minimized when 

all the probabilities have equal value Xj. = -j-. We then have Pr[t- e V(Tj)] > 

r(-p i )(l -p 1 ) r ~ ' > sx t '.exp — (sx t '.fr^) > sx t '.e~ , since sx t < < 1. ■ 
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Lemma 19 For every edge e — (xs,xy) of G' with capacity c(e) — Llog^J. 
Pr[\{Ti : e e E(Tf)}\ > c(e)] < „- tag log tag". 

Proof. Let X(e) = \{T : e e E(T), x T = 1}|. Since X(e) > |{T; : e e £(T,-)}| it 
suffices to show the bound for Pr[X(e) > c (<?)]. 

We prove the claim using the following version of Chernoff bounds [20] : 

Proposition 20 Let X\, Xi, X n be independent Poisson trials such that , for 
1 < i < n, Pr[Xi = 1] = pi, where 0 < p, < 1. Then/or X - Y!i-i x i> 
[x — E[X] — YTi=\ Pi> an d an y 8 > 0, 

Pr[X>(l + 8M<[^ m Y. 

We need to bound Pr[X > (1 + 8)/x] — Pr[X(e) > c(e)], for which we set 
8 = ^ — 1. For the expected value /x we have /x — J2reT-eeE(T) x t — sc ( e )- 
Applying Chernoff bounds we obtain: 



Pr[X(e) > c(e)] < 



■y ( ~ — 1 > -ia 1 

< „~ log log log" 



r e i l 

'-(£M) £ ^ J ~ (f (log log n) 2 ) 



Llog nj 



Lemma 21 E[\C l \] > sS/2e 2 

Proof. By Lemma 19, the capacity of an edge is violated with probability at most 
n~ 3 , since logloglogra > 3. No edge (xs, xs') of network G' is then violated with 
probability at least 1/n. The claim follows. ■ 

Lemma 22 E[\C 2 \] > \C l \/25, 

Proof. Follows from Lemma 9. ■ 

Lemma 23 There is a suitable choice ofc such that for every subsquare Q, Pr[3e e 
V(Rq) : \{fi : e e E(Ji)}\ > 1] < 1/2. 

Proof. Region R 2 q contains O ((log log n) 2 ) edges. Consider edge e of subsquare 
Q and consider the solution of the fractional packing problem. Every edge e of R 2 q 
is assigned to tree T with probability sxj. Since the fractional solution is feasible, 
HteT x t — 1- The "expected capacity" of edge e assigned to a tree Tj is then 
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bounded by s. Using the Markov inequality we have that the probability that the 
capacity of an edge e is exceeded is bounded by s. With a suitable choice of c, the 
probability that the capacity of at least one edge of R 2 q is violated is bounded by 
5 O ((log log «) 2 ) < 1/2. ■ 

Lemma 24 E[\C 4 \] > |£ 3 |/4, 

Proof. We show that the probability that a square is invalid is at most 1/2. Let Q 
and Q' be neighboring subsquares. Let Tqq be the set of trees containing an edge 
between Q and Q' and having x\ > 0. Let /x = 2~ZreT ee / Xj < (9B$ + l)s and let 
X be the number of trees T in Tqq' with xj — 1. X is clearly an upper bound to 
the number of distinct trees crossing the border between two subsquares. Choosing 
a suitable large constant c, we prove in a way similar to Lemma 19 that Pr[X > 
#s/4] < ((log «)~ logloglog "). Since there are 0(logn) 2 adjacent subsquares in a 
square S, and log log log n > 3, with probability at least 1 — £2 (1/ log n) no border 
between two adjacent subsquares is crossed by more than #s/4 distinct trees. A 
request t of multicast i of C? is in C 4 if square S t and square S Sj are valid. The 
theorem then follows. ■ 

Lemma 25 E[\C 5 \] > |£ 4 |/25, 

Proof. Let (s, t) be a request of C A and let q be a gate vertex in S s . Request 
(s, t) belongs to C 5 if q € /?| and g(p) e /?| . By Lemma 9 this happens with 
probability at least 1 /25. ■ 

Lemma 26 £[|£ 6 |] > \C 5 \/4, 

Proof. As shown by Lemma 23 a subsquare is invalid with probability at most 1 /2. 
A request t of multicast i of C 5 is in C 6 if the subsquare S g ( t ) and subsquare 5^^,.) 
are valid. ■ 

We then conclude with the following: 

Lemma 27 There exists an 0((loglog«) 2 ) approximation algorithm for long re- 
quests for the multicast routing problem on meshes. 



4.2 Short requests 

We complete the approximation algorithm for meshes describing the algorithm for 
short requests. 

We separately consider every square S of G and set of requests Ss — {(t, s) : 
S s — S}. We apply recursively the algorithm for requests Ss in a square 5. Square 
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S is partitioned in subsquares of size 0(loglog«). Denote by Q the generic sub- 
square. Those requests of Ss that are considered long requests within square S are 
dealt by the algorithm of the previous section. Those requests of Ss that are con- 
sidered short requests within a subsquare Q are dealt with exhaustive search of the 
best solution for the multicast routing problem in submesh Q of size O (log log n), 
which takes time polynomial in n. 

We then conclude with the following lemma for short requests. 

Lemma 28 There exists an 0((log log log n) 2 ) approximation algorithm for short 
requests for the multicast routing problem on meshes. 

This shows the desired theorem. 

Theorem 29 The above algorithm gives an O ((log log «) 2 ) -factor approximation 
for multicast routing on unit-capacity meshes. 

5 The online algorithm for meshes 

We propose an algorithm with polylogarithmic competitive ratio on meshes. As in 
the unicast algorithm for meshes [14] the algorithm partitions the mesh into squares 
of size 135 x 135, where 5 = 0(log«). Then it uses four main ideas: (1) It "fil- 
ters" requests in stage one to "make space" for our routing, but it guarantees that if 
a square contains requests, then at least one request of them survives the filtering. 
Thus, step one "looses" an 0(log 2 n) factor. (2) Stage two contracts each square to 
a node and runs the algorithm MC of [12] on G' . For each accepted request MC 
returns a path consisting of a sequence of neighboring squares. To translate this 
sequence into a path in the original mesh we have to be able to construct 5 disjoint 
paths between neighboring squares. The idea is that a path from a neighboring 
square enters a square in the "middle" B links between the two squares. Within 
a square each path is assigned its own concentric ring on which it proceeds until 
it reaches either the appropriate row 2 or column to exit the square or its multicast 
tree. (3) However there can be requests accepted by MC which cannot be routed 
"locally", i.e., there is a conflict in the squares of the endpoints. These requests 
have to be rejected. In the unicast setting this causes no problem since the rejection 
of a request does not affect routing of requests accepted later on. In the multicast 
setting, however, MC might output a path in G' that does not connect the request 
to its source in G since an earlier request of the same multicast was accepted by 
MC and rejected by our algorithm. We handle this situation by always connect- 
ing the same squares as MC even if the request is not accepted. (4) Since latter 

2 We use row to denote a horizontal path and column to denote a vertical path in the mesh. 
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requests might be more profitable than earlier ones, the algorithm selects each mul- 
ticast with roughly equal probability (after passing some additional screening for 
"mutability") and discards all unselected multicasts. 

5.1 The first stage of the algorithm 

Let G — (V, E) denotes the n x n two dimensional mesh. We assume that n is 
sufficiently large such that B = l^^-j > 1. 

Let f = n div [log n\ and let f\ — n mod [log n\ . We partition the mesh 
into f 2 submeshes of logarithmic size. The partition of the mesh is obtained seg- 
menting every side into f — fi contiguous segments of size [log n\ followed by f\ 
segments of size flog ri] . By abuse of notation every submesh is called a square, 
even though the size of the two sides may differ by 1. We denote the square con- 
taining a node t by S t . 

The first ring in a square 5 consists of all nodes of S that either are incident 
to a node outside of S or have degree less than 4 in the mesh G. Recursively, the 
i-th ring of S with i > 1 consists of all nodes of S that are incident to a node of 
ring i — 1 of S. Each ring, except the innermost, forms a rectangle of nodes, the 
innermost ring either forms a rectangle or a line of nodes. A corner of a ring is a 
corner of the rectangle or an endpoint of the line. The first ring is also called the 
border of S. 

In any square S we define three regions R l , R 2 and R 3 . Region R l consists of 
rings 1 to 2B, region R 2 consists of rings 2B + 1 to 4B, and region R 3 is formed 
by rings 4B + 1 to 6B and the remaining piece of S, called the central region of S. 
The central region is a rectangle with sides of size at least B, i.e., consisting of at 
least B rings. 

The first stage of the algorithm selects for each square one of its regions at 
random. The selected region is used to route paths "through" the square. All 
requests whose request node or source belong to the selected region are rejected to 
guarantee that they do not overlap with the paths routed through the selected region. 
Additionally each ring of the square is randomly dedicated either to sources or to 
request nodes and requests not following the dedication are rejected. Again the 
idea is to guarantee that requests with a source in a ring does not overlap with a 
request with a request node in the same ring. 

The details of the first stage are as follows: 

1. Dedicate each ring to multicast sources with probability 1/2, otherwise to 
request nodes. 

2. Select uniformly at random one of the three regions in each square. 
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3. Discard all the requests from vertex t to source s if t or s are in a selected 
region. 

4. Discard all the requests from a vertex t on a ring dedicated to sources, unless 
the request is directed to a source s on the same ring of t. 

5. Discard all the multicasts whose source is in a ring dedicated to requests. 

Let the original sequence of requests be called A and let the remaining set of 
requests be called C. Denote by OPT (A) the sequence of requests accepted by 
the optimal algorithm over a set of requests A. 

Lemma 30 For any input sequence a , E(\0 PT{C)\) > OPT(A)\. 

Proof. Let us consider the probability that a request (t, s) of OPT (A) belongs to 
C. Assume first that both t and s are within the same square. Then (t, s) belongs 
to C if (a) both are outside the selected region, and (b) the ring of s is dedicated to 
sources and the ring of t is either dedicated to requests or equal to s's ring. The 
condition (a) is fulfilled with probability at least 1/3, condition (b) with probability 
at least 1 /4. Thus, (t, s) belongs to C with probability at least 1/12. 

Assume next that t and s belong to different squares. Then (t, s) belongs to 
C if (a) both are outside the selected region, and (b) the ring of s is dedicated to 
sources and the ring of t is dedicated to requests. The condition (a) is fulfilled with 
probability at least 4/9, condition (b) with probability at least 1/4. Thus, (t,s) 
belongs to C with probability at least 1 /9. 

This shows that E[\OPT(A) n C\] > ±\OPT(A)\. Since \OPT(C)\ > 
\OPT(A) nC|,the result follows. " ■ 

5.2 The second stage of the algorithm. 

The second stage of the algorithm receives as input the requests of C accepted by 
the first stage, in the order in which they are presented to the algorithm. It partitions 
C into the set Co of long requests, and the set So of short requests. A request (t,s) 
is a long request if at presentation no branch of the multicast rooted at s is in S t . 
Otherwise, (t, s) is a short request. The algorithm routes short requests "locally" 
within the square and uses MC for long requests. 

Recall that each square contains 13S concentric rings. To guarantee that the 
trees used for different multicasts are edge-disjoint we maintain the invariant that 

(II) all edges of a ring that belong to any multicast tree belong to the same multi- 
cast tree. 
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To maintain the invariant each ring is assigned by the algorithm to at most one mul- 
ticast and this is the only multicast whose tree is allowed to use edges of the ring. 
To achieve this each request of C has to pass various tests. These tests guarantee 
that the following additional invariants are maintained. 

(12) No two request nodes of accepted requests belong to the same square. 

(13) No two sources of accepted requests of different multicasts belong to the same 

ring. 

(14) No two sources of accepted long requests of different multicasts belong to the 

same square. 

(15) No two sources of accepted short requests from the same square belong to the 

same square. 

(16) No two request nodes of accepted long requests belong to the same square. 
Invariant (16) follow from invariant (12). 

5.2.1 Long requests 

The algorithm for long requests decides whether to accept or reject a request in 
four steps. Each step rejects the request if certain conditions are not fulfilled. The 
requests which are not rejected after step i, i — 1, 2, 3, 4, form a sequence £,-. 

Whenever the first request of a multicast is added to £2, the algorithm decides 
whether the multicast is selected for long requests. This is needed (1) to discard 
multicasts were the "local" routing causes potential conflicts and (2) to guarantee 
that latter multicasts have roughly the same probability of being accepted as earlier 
ones. A multicast with source s is selected for long requests if all of the following 
conditions are fulfilled at the time of the test: 

(i) no multicast with source on the ring of s is already selected for short requests; 

(ii) no multicast with source in S s is already selected for long requests; 
(Hi) a coin toss with success probability 1/(45) is successful; and 

(iv) if s is in R 3 then the largest ring of R 3 in S s is dedicated to sources. 

When a multicast becomes selected for long requests, up to two of the rings 
in S s are assigned to the multicast: (a) the ring containing s is assigned to the 
multicast; and (b) if s is in R 3 , the largest ring of R 3 in S s . 
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The second ring is needed for the following reason: When routing a long re- 
quest to the border of S s the algorithm needs to be able to use any available crossbar 
row or column. However, if s belongs to the central region, the ring of s does not in- 
tersect all crossbar rows and columns. Thus, we route the path from the ring of s to 
the largest ring of R 3 and from this ring the algorithm can connect to any crossbar 
row or column. To avoid that the connection between the ring of s and the largest 
ring of R 3 overlaps with any other multicast tree, the two rings are connected along 
a "straightline extension" of the internal ring assigned to s, which will guarantee 
that the connection does not use an edge of a short request (see Section 5.2.2). 

However, the connection of s from the central region to the largest ring of R 3 
overlaps a crossbar row or column that may be used to connect a request node 
t' e R 3 of a different long request (t', s') to the border. Our algorithm will reject 
all long requests from a request node in the square if (i) a request to a source node 
in the square has been accepted, or (ii) if a long request to a request node in the 
square has been accepted. This implies that if a connection from a source in R 3 to 
the largest ring of R 3 exists then no further long requests are accepted. Thus, the 
problem is restricted to the situation when the long request from a request node in 
R 3 was accepted and afterwards a request with source in R 3 appears. We avoid the 
intersection by selecting an appropriate straightline extension connecting a ring of 
the central region to the largest ring of R 3 that avoids the crossbar row or column 
used by the long request with request node in R 3 . 

We now give the details of the decision algorithm when a request (t, s) arrives. 
Let G' be a mesh such that each square of the original mesh is represented by a ver- 
tex in G' and two vertices of G' are connected by an edge if the two corresponding 
squares are adjacent. Each edge has capacity B. 

1. If a long request with request node or source in S t has been added to £3, the 

algorithm rejects (t, s) and stops. If a short request with request node in S t 
has been accepted, the algorithm rejects (t, s) and stops. Otherwise it adds 
the request to C\. 

2. The request (t, s) of C\ is transformed into a request between the two vertices 

S t and S s of G' , and then submitted to MC. If MC accepts the transformed 
request, request (t, s) is added to £2. In this case MC also returns a route in 
G' which corresponds to a sequence of squares in the original mesh. Other- 
wise, the request is rejected and the algorithm stops. 

3. If the multicast of the request (t, s) in £2 is selected for long requests, the 

request is added to £3 and an unassigned ring of the selected region of S t is 
assigned to the multicast of the request. Otherwise the algorithm rejects the 
request and stops. 
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4. If t is not in the central region of S t , then (t, s) is added to £4. 

If t belongs to the central region of S t , and one of rings 46 + 1, . . . , 66 in 
S t is dedicated to request nodes then (t, s) is added to £4 and one of rings 
46 + 1, . . . , 66 in S t dedicated to request nodes is assigned to the multicast. 

If (t, s) is added to £4 it is accepted. The ring of t is assigned to the multicast 
of (t, s) and t is connected to the multicast tree of s. Otherwise the request 
is rejected and an arbitrary node u on the assigned ring of the selected region 
is connected to the multicast tree of s. 

Step 4 guarantees that the following invariant is maintained: 
(17) If a request (t, s) is added to £3, then a node of S f is connected to s. 

We show next how the algorithm routes a request of £3. For each such request s 
denotes the source node and v denotes the node of S t to which the request is routed, 
i.e., either v — t or v — u, where u is a node on the assigned ring of the selected 
region. For every such request we are given by MC a path in G' consisting of a 
sequence of neighboring squares. Let S\, S2, . . . , S p be the sequence of squares 
such that Si — S f and S p contains a node x of the multicast tree of s. If S p contains 
s, let x = s. Otherwise, if S p contains an accepted request node not in R 3 , let x be 
this node. Otherwise, as we show in Lemma 34, a ring of the selected region was 
assigned to the multicast and at least one node on the ring is connected to s. In this 
case let x be this node. 

Each request of £3 is routed from v to the border between Si and S2, from there 
to the border between S2 and S3 and so forth, until finally to the border between 
S p -i and S p and from there to x. Let P denote the resulting path from v to x. We 
describe next each step in detail. 

To route paths between two neighboring squares we reserve the crossbar of 
rows 66 + 1, . . . , IB and columns 66 + 1, ... , 76. All rows and columns in the 
crossbar cross the central region of the square. By Lemma 34 there is an unassigned 
crossbar row resp. column for each accepted request. 

Case 1: From v to the border between S\ and S2: Wlog the border between Si 
and S2 is a row. We assign an unassigned crossbar column and the ring of v 
to the multicast. 

If v does not belong to the central region, we route P along the ring of v until 
it reaches the point on its assigned crossbar column closest to the border. At 
this point P is routed along the assigned column until it reaches the border. 

If v belongs to the central region, we route P on the ring of v until a corner 
of the ring is reached. There P continues straight to the assigned ring of R 3 . 
From the assigned ring of R 3 we continue as above. 
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Case 2: From one border of a square to another border of the square: Wlog the 
entering border is a column and the exiting border is a row. One unassigned 
ring of the selected region is assigned to the multicast. We additionally as- 
sign one of the unassigned crossbar columns between the current square and 
the next square to P. 

Path P follows the entering row until it intersects the assigned ring. Then 
P is routed along the ring until it reaches the intersection point with the as- 
signed column closest to the exiting border. There P switches to the assigned 
column until it reaches the border. 

Case 3: From the border of S p to the node x: Wlog the border between S p -i 
and S p is a row. There are three cases to consider: (i) x = s, (ii) x $ R 3 
is a request node of an accepted request of the multicast, or (iii) x is a node 
connected to s that belongs to a ring of the selected region assigned to the 
multicast. 

(i) If s does not belong to R 3 , we route P along the entering column until 
the ring of s is reached. At this point P is routed along the ring of s to s . Us 
belongs to R 3 , P follows the entering column until the largest ring of R 3 is 
reached. Then, P is routed along this ring until it reaches a point from which 
a row or column is available that (1) connects straight to a corner of the ring 
of s and (2) does not intersect any existing multicast tree. By Lemma 34 
such a row or column exists. Path P routes along this row or column to the 
ring of s and along the ring of s to s. 

(ii) When the request at x g R 3 was accepted, the ring of x was assigned to 
the multicast. We route P along the entering column until the ring of x is 
reached. At this point P is routed along the ring of x to x. 

(iii) We route P along the entering column until the assigned ring of the 
selected region is reached. At this point P is routed along the ring to x. 

We need to show that this routing is always possible. We first show two prop- 
erties of MC. 

Lemma 31 The maximum number of paths routed by MC between two adjacent 
squares is B. 

Proof. Each edge in G' has capacity B and MC does not violate the capacity 
constraints. ■ We say a path in G' is routed through a square S if S is an internal 

node of the path. 
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Lemma 32 Let S be a square. There are at most 2B paths of MC routed through 
S.Ifa request node of a request in C2 belongs to S, then at most 2B — 1 paths of 
MC are routed through S. 

Proof. Every path routed through a square consumes two units of bandwidth on 
the edges incident to the vertex of G' associated with the square. Additionally, a 
long request consumes one unit of bandwidth. The overall bandwidth on the edges 
incident to a vertex of G' is 4B. ■ 

We prove with the next two lemmata that all paths constructed for different 
multicasts are edge-disjoint. 

Lemma 33 Invariants (14) and (16) are maintained. 

Proof. Condition ( ii) of being selected for long requests guarantees that invariant 
(14) is maintained. Step 1 of the algorithm guarantees invariant (16). ■ 

Lemma 34 Let (t, s) be a request of £3 and let (v, x) be the corresponding pair 
of nodes that has to be connected by a path. Then the above algorithm succeeds in 
constructing a path from v to x. Furthermore, this path does not share any edge 
with paths constructed for requests in C^for different multicasts. 

Proof. We first show that the algorithm succeeds in constructing a path from v to x. 
We need to show that (a) whenever the algorithm tries to assign an unassigned ring 
of the selected region to the multicast, such a ring is available; (b) if s $ S p and 
no request node of this multicast in R l or R 2 was accepted then s is connected to 
a node x on a ring of the selected region of S p that is assigned to the multicast; (c) 
whenever the algorithm tries to assign an unassigned column or row between two 
neighboring squares, such a column or row is available; (d) whenever the algorithm 
tries to connect the largest ring of R 3 with vertex s in R 3 , such connection is 
possible without overlapping with a different multicast tree. 

(a) The selected region consists of 2B rings. The algorithm tries to assign a 
ring of the selected region for each request of £3 with request node in the square 
and for each request routed through a square. Step (1) of the algorithm guarantees 
that at most one long request with request node in a square belongs to £3. If this 
happens, Lemma 32 shows that at most 2B — 1 paths are routed through the square, 
otherwise at most 2B paths are routed through the square. Thus, in either case there 
are enough rings in the selected region. 

(b) If s g S p but MC routed the path from S t to S p , then MC connected S p to 
S s during an earlier request. Thus, an earlier request (t' , s) of the same multicast 
was added to £2 and, since the multicast is selected for long requests, was also 
added to £3. Thus, a ring r of the selected region of S p is assigned to the multicast. 
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If (t', s) e £3 \ £4, then a node u of r is connected to s. If (?', 5) e £4, then by 
assumption e R 3 . Thus, the crossbar row or column on the path from t' to the 
border of S p also contains a node of r. 

(c) The crossbar consists of B rows and columns. By Lemma 31 there are 
at most B requests that have to cross the border between any two neighboring 
squares. Thus, it is guaranteed that whenever a path needs to be routed between 
two neighboring squares an unassigned row or column is available. 

(d) The ring of s has at least one corner, and thus there are at least two possible 
rows and two possible columns through which the largest ring of R 3 can be con- 
nected with the ring of s. At most one long request (t', s') with t' in R 3 has been 
accepted in S s , potentially using one of these crossbar rows or columns. Thus, 
there are still at least three possible ways to connect to the ring of s. 

Let S be a square. We show next that the path from v to x does not share any 
edge in 5 with paths constructed for requests in £3 for different multicasts. The 
lemma follows. 

Note first that no ring of S shares an edge with a crossbar row or column or a 
straightline segment connecting two rings. To complete the proof that no two paths 
of different multicasts share an edge in S, it then suffices to show that the rings of 
the two paths are edge-disjoint and that the straightline segments of the two paths 
are edge-disjoint. The straightline segments connect the various rings of a path and 
a ring with the border of S. 

A path that is routed through 5 consists of an entering crossbar row or column, 
a part of a ring in the selected region, and a departing crossbar row or column. 
Both crossbar pieces go from the border to the selected region. By points (a) and 
(c) and the observation that the straightline extensions belong to R 3 , which is in 
this case not the selection region, all these three pieces are disjoint from the paths 
used by any other request path with edges in S. Thus, any path routed through S is 
edge-disjoint from any path of a different multicast in S. 

We are left with showing that no two paths of long requests starting or ending 
in 5 share an edge in S. Let P and P' be two such paths belonging to different 
multicasts and let y and y' be their endpoints in 5. By invariants (14) and (16), 
y and y' can only be a source node and a non source node. Wlog., y is a source 
and y' is a non-source node. Node y belongs to a ring that is assigned to sources 
and does not belong to the selected region, node y' either belongs to the selected 
region or to a ring that is assigned to requests. In either case, the rings of y and of 
y' are edge-disjoint. Furthermore, the largest ring of R 3 might have been assigned 
to path P and in this case is dedicated to sources. A different ring of R 3 , which is 
dedicated to request nodes, might have been assigned to path P'. Again these rings 
are edge-disjoint. 

By point (c) the crossbar rows or columns assigned to P and P' are different 
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and, thus, edge-disjoint. Since the straightline extensions start at different rings, 
they are edge-disjoint. We are left to show that the straightline extensions con- 
necting the rings assigned to y and y' are edge-disjoint with the crossbar rows or 
columns used in P and P'. If neither y nor y' belong to R , no straightline ex- 
tension is necessary. If only one between y and y' belongs to R 3 , one straightline 
extension exists, belonging completely to R 3 , and the crossbar row or column of 
the other path does not reach R 3 . Thus, of P and P' are edge-disjoint. If both y 
and y' belong to R 3 , note that by Step 1 P' was accepted and routed before P. By 
point (d) above the straightline extension of P does not overlap with P' . 

It follows that P and P' are edge-disjoint. ■ 

5.2.2 Short Requests 

Recall that a request (t,s) is short if a node x in S t belongs to the multicast tree of 
s. The algorithm for short requests decides whether to accept or reject a request in 
three steps. Each step rejects the request if certain conditions are not fulfilled. The 
requests which are not rejected after step i,i — 1,2,3 form a sequence Sj. 

Whenever the first request with S t — S s of a multicast is added to S2 , the 
algorithm decides whether the multicast is selected for short requests. Note that 
this decision will only affect short requests with S t — S s , other short requests of 
the same multicast are accepted or rejected independent of this decision. 

A multicast with source s is selected for short requests if all of the following 
conditions are fulfilled at the time of the test: 

C i) no short request of a multicast with source in S s has been added to £2 ; 
(ii) a coin toss with success probability 1/2 is successful. 

Whenever a multicast is selected for short requests, the ring of s is assigned to the 
multicast. 

In the following let y denote a node of S t that belongs to the multicast tree of 
s. Note that y — s is possible. The decision part of the algorithm for short requests 
consists of three steps: 

1. (a) If a short request with request node in S t has been accepted then reject (t, s) 

and stop, (b) If a long request with request node in S t has been added to £3, 
reject (t, s) and stop, (c) If a long request with source in S t has been added 
to £3, reject (t, s) and stop. Otherwise add (t, s) to Si. 

2. If either t or y, but not both, is in the central region, the other vertex is not in 

R 3 , and ring 4B + 1 to 65 of St are all dedicated to sources, then reject (t , s) 
and stop. Otherwise add (?, s) to S2. 
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3. If S t ^ S s or if S t — S s and the multicast with source s is selected for short 
requests, then add (t, s) to S3. Otherwise reject (t, s) and stop. 

Accept every request (t,s) in S3 and assign the ring of t to the multicast. If 
either t or y, but not both, belong to the central region of S t , and the other 
vertex does not belong to R 3 , assign one of rings 45+ 1 to 65 of S t dedicated 
to requests to the multicast of s. 

We describe next the routing part of the algorithm. Let P denote the path used 
by an accepted request (t, s) to connect t to y. If f and y belong to the same ring, 
simply connect them by a path along the ring. 

If t and y do not belong to the same ring, let a denote the vertex of {t, y} that 
belongs to a ring of S t with larger index than the other vertex. Let b denote the 
other vertex. If a is outside the central region then P starts at a and follows the 
ring containing a until a corner of the ring is reached. Then, P continues along a 
straightline extension of the ring to the ring of b and from there along the ring of b 
to b. 

If a is in the central region and b is not, then P starts at a, follows the ring 
of a until a corner of the ring is reached. There P continue straight to the ring in 
the external 25 rings of R 3 assigned to the multicast. Then, path P continues as 
described above, i.e., it routes to a corner of the ring, extends straight from there 
until it reaches the ring of b, and follows the ring of b to b. 

If both a and b belong to the central region, P is routed from a along the ring 
of a until it reaches a corner of the ring. From there it continues straight until it 
reaches the ring of b and it follows the ring of b to b. 

Lemma 35 Invariant (II), (12), (13), and (15) hold. 

Proof. Step 1(a) and 1(b) of the algorithm for short requests and step 1 of the 
algorithm for long requests ensure that invariant (12) holds. 

Condition ( i) and ( ii) of being selected for a long request and condition ( i) of 
being selected for a short request ensure invariant (13). 

Condition (i) of being selected for a short request ensure invariant (15). 

Finally, we show that invariant (II) holds. The routing phase guarantees that 
edges on a ring are only used by the multicast to whom the ring is assigned. Thus, 
we need to show that each ring is assigned to at most one multicast. A ring is 
assigned to a multicast during request (t, s) (a) if it contains s, (b) if it contains t, 
(c) if it belongs to the selected region, (d) if it is the largest ring of S s and dedicated 
to sources or, (e) if it is one of rings 45 + 1 to 65 in S t and dedicated to requests. 

During a request (t, s) a type (c) assignment is only done if the ring was not 
previously assigned. Thus, we only need to show that type (a), (b), (d), and (e) 
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assignments do not assign previously assigned rings. Recall that each ring is either 
dedicated to a source or to a request node. A request is rejected if its source lies 
on a ring dedicated to request nodes or its request node lies on a ring dedicated to 
sources and its source does not lie on this ring. In the following let r denote the 
ring that is assigned. 

Type (a): The ring r of s is dedicated to sources. When the ring r of s is 
assigned to the multicast with source s, the multicast with source s is selected for 
long or short requests. If it becomes selected for long requests, by condition ( i) and 
( ii) for selection for long requests no multicast with source on r is already selected 
for short or long requests. If the multicast becomes selected for short requests, 
note that S s — S t . By by condition (i) for selection for short requests and step lc of 
the algorithm for short requests no multicast with source in S s is already selected 
for short or long requests. Thus, r was not assigned to any other multicast with 
source on r. Furthermore, r does not belong to the selected region. Thus, r has 
not been assigned because of type (a), (b), (c), or (e) assignments. Finally a type 
(d) assignment can happen only if a previous source of a long request belongs to 
S s . This is impossible by condition (ii) of being selected for a long request if the 
current request (t, s) is a long request, and by step (lc) of the algorithm for short 
requests if the current request (t, s) is a short request. 

Type (b): The ring r is dedicated to request nodes and does not belong to 
the selected region. Thus, r was not previously assigned with type (a), (c), and 
(d) assignments. Since (t, s) is an accepted short request, no earlier request with 
request node in S t was accepted by invariant (12). Thus, r was not previously 
assigned by type (b) or (e) assignments. 

Type ( d): In this case ring r belongs to S s and is dedicated to sources. Further- 
more, s e R 3 , i.e., R 3 is not selected and the current multicast is selected for long 
requests. It follows that no multicast with source in S s is already selected for long 
requests and no multicast with source on r was previously selected. The former 
implies that no type (d) assignment has happened to r before, the latter implies that 
no type (a) assignment has happened to r before. Since r does not belong to the 
selected region and is dedicated to sources, type (b), (c), and (e) assignments are 
not possible. 

Type (e): In this case ring r belongs to S t and is dedicated to request nodes. 
Furthermore, R 3 is not selected and t e R 3 . It follows that type (a), (c), and (d) 
assignments are not possible. By invariant (12) no earlier request with request node 
in S t was accepted. Thus, no previous type (b) or (e) assignment can have occurred. 



Lemma 36 No two paths constructed for different multicasts share an edge. 
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Proof. Let 5 be a square. We show that no two paths P and P' constructed for 
different multicasts share an edge in S. By Lemma 34 no two paths assigned to 
long requests share an edge. Thus, we can assume that P is a path constructed for 
a short request. Let (t, s) be the request leading to the construction of P and let 
it' , s') be the request leading to the construction of P' . 

By step (la) of the algorithm, only one short request is accepted in a square. 
By step (lb) of the algorithm, if a long request from a request node in a square has 
been added to £3, no short request from that square is accepted. By step (lc) of the 
algorithm if a long request with source node in a square was added to £3, no short 
request from that square is accepted. Thus, it follows that when (t, s) is accepted, 
no previously accepted request has either its source or its request node in S t . Thus, 
either P' routes through S t or (t', s') is a long request accepted after (t, s). By step 
1 of the algorithm for long requests it additionally follows that S t ^ 5;', i.e., that 
St — S s >. 

Lemma 35 shows that we maintain invariant (II), i.e., the edges of a given ring 
are used by at most one multicast tree. Thus, no two different multicast trees share 
an edge on a ring. As argued in Lemma 34, no ring of S shares an edge with a 
crossbar row or column or a straightline segment connecting two rings. 

We are left with proving that the subpaths of P' used to connect the (at most 
2) rings of P' and the border(s) of the square are edge-disjoint with the (at most 2) 
subpaths of P that connect the (at most 3) rings used by P. 

Each subpath used by P is a straightline extension either (a) from a corner of a 
ring outside the central region to a more external ring, (b) from a corner of a ring of 
the central region to a ring of R 3 with smaller index, or (c) from a corner of a ring 
of the central region to another ring of the central region. Each subpath used by P' 
is either (i) a straightline extension from a corner of a ring of R 3 to ring 4B + 1, 
or (ii) a crossbar row or column from a vertex of ring 45 + 1 or of a ring outside 
R 3 to the border of the square. Since they start from different rings, every type 
(i) subpath of P' is edge-disjoint with all type (a), (b), or (c) subpaths used by P. 
Every type (ii) subpath of P' is edge disjoint with any type (a) subpath of P, since 
a type (a) subpath does not use edges on the crossbar. Every type (ii) subpath of 
P' is also edge-disjoint with any type (b) or (c) subpath of P, since type (b) or (c) 
subpaths only use edges with both endpoints in R 3 . 

Thus, P and P' are edge-disjoint. ■ 

5.2.3 The analysis 

Let p = log n (log n + log log M) log M.. Recall that MC achieves a competitive 
ratio of O(p). We prove in this section that the expected number of requests ac- 
cepted by the second stage of the algorithm is an O(plogn) fraction of OPT(C), 
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for any possible set C. Together with Lemma 30 it follows that our algorithm is 
0(p log n) = O (log 2 n (log n + log log M) log M) competitive. 
Since 

\OPT(C)\= \OPT(C)f)C Q \ + \OPT(C)nS Q \ 

= \OPT(C) n ((£ 0 \ Ci) U (<S 0 \ + \OPT(C) n Al + |OPT(C) n 5i 

it suffices to show the following results: 

\OPT(C) n ((£ 0 \£i) U (<S 0 \<Si))| < 481og 2 ?i£[|6W(C)|] (Lemma 39), 

\OPT{C)r\d\ < 0(p log n)E[\ON(C)\] (Lemma 40), and 

\OPT(p)nSi\ < O (log 2 n)E[\ON (C)\] (Lemma 42). 

We first need to show the following claim. We assume here that each node can 
receive at most one request per multicast. 

Claim 37 At most 4 log 2 n requests with request node in a given square can be 
accepted in a solution. 

Proof. Each node in a square is incident to four edges and thus can belong to at 
most four multicast trees. Thus at most 41og 2 « requests with request node in a 
square can be accepted by a solution. ■ 

Claim 38 Every request of £3 is added to £4 with probability at least 1/2, i.e., 
E[\U\\ > |£ 3 |/2. 

Proof. Let (t, s) be such a request of £3. By Step 4 of the algorithm for long 
requests, (t, s) is not added to £4 if t belongs to the central region and rings 4B + 1 
to 6B are all dedicated to sources. If t belongs to R 3 , R 3 is not the selected region, 
and thus every ring of R 3 is dedicated to request nodes with probability 1/2. ■ 

Lemma 39 \OPT(C) n ((£ 0 \ £1) U (<S 0 \<Si))l < 4log 2 nE[\ ON (C)\] 

Proof. We first show that \OPT(C) f) (£ 0 \ £i)| < 24log 2 nE[\ON(C)\]. 

A request (t, s) of £0 is not added to C\ because (i) a long request with request 
node in 5; was added to £3 or (ii) a long request with source in 5 ( was added to £3 
or (iii) a short request with request node in S t was accepted. Consider the following 
charging from long requests in £o\£i to squares of the mesh: (A) If a request (t, s) 
of £0 was not added to C\ because situation (i) or (iii) arose, the request is charged 
to S t . (B) If (t, s) was not added because a type (ii) request (t' , s') was added to 
£3, we charge (t, s) to S t i. Claim 37 shows that there are at most 4 log 2 n type (A) 
charges and at most 4 log 2 n type (B) charges to a given square. 
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Let q be the number of squares to which at least one request of Co \ L\ is 
charged. Then \OPT(C) Pi (Co \ < Slog 2 nq. It suffices to prove q < 
3E[\ON(C)\] 

Every square with a type (A) charge contains the request node of an accepted 
request of C. Every square with a type (B) charge contains the request node of 
a request in £ 3 . By Claim 38, |£ 3 | < 2E[\ON(C)\]. Thus, q < \ON(C)\ + 
2E[\ON(C)\l 

Then, \ OPT(C) n OC 0 \A)I < 81og 2 «^ < 24 log 2 nE[\ ON(C)\] 

The same argument applies to So \ 5 1 ■ ■ 

Lemma 40 \OPT(C)C\C x \ < 0(plogn)E[\ON(C)\] 

Proof. The requests of C\ are transformed into requests on G', giving rise to a 
request sequence £i,g' submitted to MC. The bandwidth on M is a factor of 13 
larger than the bandwidth on G' . On the request sequence we compare MC 
on G' with an optimum algorithm on a mesh G" whose topology is identical to G' 
and whose edge capacity is a factor of 13 larger. Then we compare the optimum 
algorithm on G" with the request sequence C\^& with the optimum algorithm on 
M with the request sequence C\. Note that ONmc(£i,g') = £-2- 

The same arguments as in [4, 14] show that | OPT(C hG »)\ < 0(p)E[\ ON(C hG >)\] 
- 0(p)E[\C 2 \l 

Note that \ 0 PT(Ci, G »)\ > \OPT(C\)\, since routing requests on G" is iden- 
tical to routing requests in M where all edges inside a square have infinite capacity. 
Thus, 

\OPT(C)C\Ci\ < \OPT(d)\ - 0(p)E[\C 2 \l 

Claim 41 below shows that \£ 2 \ < 0(log «)£[|£3|]. From Claim 38 we obtain 
that |£ 3 | < 2£[|£ 4 |]. Since |£ 4 | < |O^V(C)|, this concludes the proof of the 
lemma. ■ 

Claim 41 \C 2 \ < O (log n)E[\C 3 \] 

Proof. A request of C 2 is added to £3 if its multicast is selected for long requests. 
We show that a multicast is selected for long requests with probability Q (1/ log n). 
This shows the claim. 

A multicast is selected for long requests if conditions (i) - (iv) hold. Let r be 
the ring of of the source s of the multicast. We bound next the probability that each 
condition holds. 

Condition ( i): The first short request with source on r added to £2 is selected for 
short requests with probability at most 1 /2. None of the short requests with source 
on r added later to £2 can become selected for short requests. Thus, condition ( i) 
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that no source on r is already selected for short requests holds with probability at 
least 1 /2. 

Condition (ii): The total capacity of edges incident to S s is 45. Thus there 
exist at most 45 multicasts with source in S s in £2- Each of them is selected with 
probability at most l/(4B). Thus the probability that condition (ii) holds, i.e., that 
none of them is selected, is at least (1 — 1/(45)) 4B > l/e. 

Condition (Hi): The coin toss is successful with probability 1/(45). 

Condition (iv): The largest ring of R 3 in S s is dedicated to sources with proba- 
bility 1/2. 

Thus a multicast is selected for long requests with probability at least l/(16e5) = 
£2(1/ log «). ■ 

Lemma 42 \OPT(C)r\S x \ = <9(log 2 n)E[\ON(C)\] 

Proof. Since \OPT(C) n «Si| = \OPT(C) n (Si \ S 3 )\ + \OPT(C) n <S 3 | and 
\OPT(C) n <S 3 | < \ ON(C)\ it suffices to show 

\OPT(C)n(S i \S 3 )\<32\og 2 nE[\ON(C)\]. 

Let S be a square and let C n 5 denote the sequence C restricted to the requests 
with either request or source node in 5. We show that for each square 5 that 

\0PT(Cns)n(S l \s 3 )\ < mog 2 nE[\ON(C) n s\i 

Since 

\OPT(C)n(S { \s 3 )\ < ^|OPr(Cn5)n(<s 1 \<S 3 )| 

s 

< ^log 2 nE[\ON(C) n S\] 

s 

< 321og 2 £[|O^V(C)|]. 

By Claim 37, 

I opt(C n s) n (Si \s 3 )\ < \ opt(Cds)\ <4iog 2 «. 

It suffices to show that 

(*) For each square S with Si n S ^ 0, E[\ON(C) n 5|] > 1/4. Consider 
the first request (t, s) of S\ n S. It is added to £2 if t and y either both belong to 
the central region or both not belong to the central region. Otherwise, the request 
is added to £2 if at least one of rings 45 + 1 to 65 is dedicated to request nodes. 
This happens with probability at least 1/2. 
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If y ^ s, then the request is added to £3 and the claim is proved. If y — s, 
then the request is added to £3 if the multicast is selected for short requests. If 
Condition (i) of being selected for short request is not satisfied, there exists a short 
request (t' , s') of £2 with 5,' — S for which Condition (i) of being selected for 
short requests was satisfied, that has been added to £3 with probability 1 /2. If 
Condition (i) of being selected for short requests holds for source s, then (t, s) is 
added to S3 with probability 1/2. Thus, with probability 1/2 at least one request 
of S2 is added to S3. This shows that if S\ n S ^ 0, then a request of C n S is 
accepted by the online algorithm with probability at least 1/4. ■ 

Lemmata 39, 40 and 42 together prove the following theorem: 

Theorem 43 There exists a O (log 2 n (log n + log log .M) log .M) competitive al- 
gorithm for multicast routing on unit capacity meshes. 

5.3 A lower bound for the online algorithm on meshes 

In [12] a lower bound of Q(\og{M./u)\ogn) against an oblivious adversary is 
given for the online multicast algorithm in a tree, where u is the (minimum) edge 
capacity, A4 is the number of multicasts, and n is the size of the tree. We show 
here how to modify the proof to give a lower bound of Q((log(M/u) log n)/d) 
in a connected graph whose minimum degree is d. This gives a polylogarithmic 
lower bound for a mesh. 

We assume first that all demands and all edge capacities are 1 and prove a lower 
bound of £2(log.M \ogn). Afterwards we show how to add edge capacities to the 
proof. 

We restrict ourself to the case that «Jn > logA4. Let M — A4/log« and 
N = n/logM. Assume that the nodes are numbered with consecutive numbers 
from 0 to n — 1 such that the node with minimum degree is labeled 0. Consider the 
following sequence of multicast requests. There are log N phases. In the following 
we describe phase i, with 1 < i < log N. In each phase the adversary generates 
M new multicasts. Each multicast has source 0. Its requests consists of log M sets. 
The first set of requests consists of the nodes 1 to 2 l . Then the adversary flips a coin 
and terminates the multicast with probability 1/2. The j-th set of requests consists 
of nodes 2 l * ( j — 1) + 1 to 2 1 * j. After processing all M multicasts a new phase 
starts. 

Let c(i) be the capacity on any edge incident to node 0 used by the online 
algorithm during phase i. Also, let p(i) be the profit obtained by the online algo- 
rithm during the j-th phase. We use c*(z) and p*(i) to denote the corresponding 
quantities for the optimal algorithm. 
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By contradiction we show below that there exists a phase k such that Xa<i<£ E[p(i)] < 
2 k+l d/ log N. The adversary stops the sequence of requests after this phase k. 
Claim 8.2 of [12] proves that during phase k the optimal algorithm can obtain a 
profit of at least (log M/4)2 k by accepting the multicast with highest profit in this 
round. This gives a lower bound of Q ((log M log N)/d) = Q ((log M log n)/d). 

We are left with showing that there exists aphase k such that Xa<(<& E[p(i)] < 
2 k+l d/logN. Assume by contradiction that no such phase exists. Let S(k) = 
(1/2*) Ei<,-<* E[p(i)]. Then for all*, S(k) > 2d/logN, i.e., £i<*<i og iv S(k) > 
2d. But 

l<k<\ogN \<k<logN l<i<k l<i<logiV 

Claim 8.1 of [12] shows that E[p(i)] < T E[c(i)l Since c (0 < d > il follows 
that Ei<yt<io g iv 5 W < Ei</<iogJV 2£ t c (0] < 2rf, which gives a contradiction. 

If the edge capacity is u, each phase is repeated u times. This increases the 
number of multicasts by a factor of u. Thus, the same proof as above gives a 
lower bound of Q ((log(M/u) log n)/d). We summarize the result in the following 
theorem. 

Theorem 44 No randomized algorithm for online multicast routing on a con- 
nected graph with minimum degree d > 0 can have a competitive ratio better 
than £2 ( (log (.M/m) log n)/d) even against an oblivious adversary, where u is the 
capacity of an edge. 
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